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Title : Novel Hematopoietic Genes and Polypeptides 
FIELD OF THE INVENTION 

The invention relates to novel genes primarily expressed in hematopoietic cells, polypeptides 
encoded by the novel genes and truncations, analogs, homologs, and isoforms of the polypeptides; and, uses 
5 of the polypeptides and genes. 

BACKGROUND OF THE INVENTION 

Many of the molecular events involved in the embryonic development and differentiation of 
hematopoietic cells remain to be elucidated. In the mouse, several genetic approaches have been employed 
in an attempt to identify and determine the function of novel hematopoietic genes. One approach involves 

10 the cloning of mutant loci responsible for altered hematopoietic phenotypes in mice harboring naturally 

occurring or induced mutations [1,2,3]. A complementary approach involves the generation of new 
mutations in the mouse germline by homologous recombination in embryonic stem (ES) cells to mutate 
candidate hematopoietic genes [4,5,6,7,8,9,10]. Both approaches are labour intensive, the former being 
restricted to the repetoire of mutant phenotypes while the latter is restricted to previously identified genes, 

15 Thus, neither approach is easily applied on a genome-wide level. In contrast, gene trapping in ES cells 

provides an effective approach to screen and analyze novel genes because it simultaneously provides 
information on the pattern of expression in vivo, sequence and resulting mouse mutant phenotype 
[11,12,13.14,15,16,17,18]. 

One potential drawback of gene trapping is the very large size of the mammalian genome and 

2 0 hence the integration of the gene trap vector into genes diat may not be of immediate interest. To p re-select 

for genes that are preferentially expressed in hematopoietic cells, an expression trapping strategy has been 
developed that takes advantage of the developmental potential of ES cells to differentiate into diverse cell 
lineages in vitro, including hematopoietic cells [19,20,21,22,23,24,25]. Upon the removal of leukemia 
inhibitory factor (LIF), ES cells spontaneously differentiate into embryoid bodies (EBs), structures which 

2 5 resemble postimplantation embryos, comprising a number of cell lineages and structures, including blood 

islands which contain hematopoietic precursor cells. Secondary plating of cells obtained from dissociated 
EBs in semisolid cultures, containing various cytokines and growth factors, results in the growth of colonies 
of erydiroid, myeloid and lymphoid lineages [19,24,25,26,27,28]. Recently, in vitro differentiation of ES 
cells has been combined with gene trapping strategies to pre-screen for novel genes which respond to 

3 0 exogenous factors [29,17] or are regulated in a tissue-specific manner prior to transmission into the mouse 

germline [18, 30,31,32]. In order to screen large numbers of cell clones based on the expression of trapped 
genes in hematopoietic and endothelial cells, an expression trap screen has been combined with an attached 
EB culture system to identify genes expressed within blood islands and endothelial cells [18], Muth et 
a/.,(1998) [32] have developed a related approach to identify genes that are repressed during hematopoietic 
3 5 differentiation. 

SUMMARY OF THE INVENTION 

The ES cell-0P9 co-culture system has been combined with gene trapping to screen for genes 
involved in the development and differentiation of hematopoietic cells. The OP9 stromal cell line induces 
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ES cell differentiation into mesodermal colonies that, when replated, differentiate into single lineage 
precursors, without the addition of exogenous growth factors [33]. Using this system, the present inventors 
have identified two novel genes designated Hzf (hematopoietic zinc finger) and Hlil (denoting the 
embryonic expression seen within hematopoietic cells, heart and liver) which are expressed in a regulated 
5 pattern during hematopoietic differentiation in vitro. These genes exhibit hemalopoietic-specific expression 

in vivo, indicating that the in vitro pre-screening strategy successfully predicts the expression of trapped 
genes in vivo. Within the hematopoietic compartment, both genes were predominantly expressed in 
megakaryocytes. 

Broadly stated the present invention contemplates an isolated hematopoietic nucleic acid molecule 
1 0 encoding a Hzf or Hhl polypeptide of the invention, including mRN As. DNAs, cDNAs, genomic DN As, 

PNAs. as well as antisense analogs and biologically, diagnostically, prophyiactically, clinically or 
therapeutically useful variants or fragments thereof, and compositions comprising same. 

The invention also contemplates isolated hematopoietic Hzf or Hzl polypeptides encoded by a 
nucleic acid molecule of the invention a truncation, an analog, an allelic or species variation thereof, or a 
1 5 homolog of a polypeptide of the invention or a truncation thereof, (Truncations, analogs, allelic or species 

variations, and homologs are collectively referred to herein as "Hzf Related Polypeptides" or "Hhl Related 
Polypeptides). 

The nucleic acid molecules of the invention permit identification of untranslated nucleic acid 
sequences or regulatory sequences that specifically promote expression of genes operatively linked to the 
2 0 promoter regions. Identification and use of such promoter sequences are particularly desirable in instances, 

such as gene transfer or gene therapy, which may specifically require heterologous gene expression in a 
limited environment e.g hematopoietic system. The invention therefore contemplates a nucleic acid 
molecule comprising a non-coding sequence such as a and/or 3'* sequence, preferably a non-coding 
sequence of Hifor Hhl. 

2 5 The nucleic acid molecules which encode for the mature Hzf or Hhl polypeptide (may include 

only the coding sequence for the mature polypeptide; the coding sequence for the mature polypeptide and 
additional coding sequences (e.g. leader or secretory sequences, propolypeptide sequences); the coding 
sequence for the mature polypeptide (and optionally additional coding sequence) and non-coding sequence, 
such as introns or non-coding sequence 5' and/or 3* of the coding sequence of the mature polypeptide. 
^ 0 Therefore, the term **nucleic acid molecule encoding a polypeptide'* encompasses a nuc leic acid 

molecule which includes only coding sequence for the polypeptide as well as a nucleic acid molecule which 
includes additional coding and/or non-coding sequences. 

The nucleic acid molecules of the invention may be inserted into an appropriate vector, and the 
vector may contain the necessary elements for the transcription and translation of an inserted coding 

3 5 sequence. Accordingly, vectors may be constructed which comprise a nucleic acid molecule of the 

invention, and where appropriate one or more transcription and translation elements linked to the nucleic 
acid molecule. 
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Vectors are contemplated within the scope of the invention which comprise regulatory sequences 
of the invention, as well as chimeric gene constructs wherein a regulatory sequence of the invention is 
operably linked to a heterologous nucleic acid, and a transcription termination signal. 

A vector can be used to transform host cells to express a Hzf or Hhl Polypeptide or a Hzf or Hhl 
5 Related Polypeptide, or a heterologous polypeptide (i.e. a polypeptide not naturally in the host cell). 

Therefore, the invention further provides host cells containing a vector of the invention. The invention also 
contemplates transgenic non-human mammals whose germ cells and somatic cells contain a vector 
comprising a nucleic acid molecule of the invention in particular one that encodes an analog of Hzf or Hhl, 
or a truncation of Hzf or Hhl. 
1 0 The polypeptides of the invention may be obtained as an isolate from natural cell sources, but they 

are preferably produced by recombinant procedures. In one aspect the invention provides a method for 
preparing a Hzf or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide utilizing the purified and isolated 
nucleic acid molecules of the invention. In an embodiment a method for preparing a Hzf or Hhl 
Polypeptide, or a Hzf or Hhl Related Polypeptide is provided comprising: 
1 5 (a) transferring a vector of the invention comprising a nucleic acid sequence encoding a Hzf or 

Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide, into a host cell; 

(b) selecting transformed host cells from untransformed host cells; 

(c) culturing a selected transformed host cell under conditions which allow expression of the Hzf 
or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide; and 

2 0 (d) isolating the Hzf or Hhl Polypeptide, or Hzf or Hhl Related Polypeptide. 

The invention further broadly contemplates a recombinant Hzf or Hhl Polypeptide, or Hzf or Hhl 
Related Polypeptide obtained using a method of the invention. 

A Hzf or Hhl Polypeptide, or Hzf or Hhl Related Polypeptide of the invention may be conjugated 
with other molecules, such as polypeptides, to prepare fusion polypeptides or chimeric polypeptides. This 

2 5 may be accomplished, for example, by the synthesis of N-terminal or C- terminal fusion polypepi;ides. 

The invention further contemplates antibodies having specificity against an epitope of a Hzf or 
Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide of the invention. Antibodies may be labeled with 
a detectable substance and used to detect polypeptides of the invention in biological samples, tissues, and 
cells. 

3 0 The invention also permits the construction of nucleotide probes that are unique to nucleic acid 

molecules of the invention and/or to polypeptides of the invention. Therefore, the invention also relates to 
a probe comprising a sequence encoding a polypeptide of the invention, or a portion (i.e. fragment) thereof 
The probe may be labeled, for example, with a detectable substance and it may be used to select from a 
mixture of nucleic acid molecules a nucleic acid molecule of the invention including nucleic acid molecules 
3 5 coding for a polypeptide which displays one or more of the properties of a polypeptide of the invention. 

In accordance with an aspect of the invention there is provided a method of, and products for 
diagnosing and monitoring conditions mediated by Hzf or Hhl by determining the presence of nucleic acid 
molecules and polypeptides of the invention. 
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Still further the invention provides a method for evaluating a test substance or compound for its 
ability to modulate the biological activity of a Hzf or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide 
of the invention. For example, a substance or compound which inhibits or enhances the activity of a Hzf 
or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide may be evaluated. "Modulate" refers to a change 
or an alteration in the biological activity of a polypeptide of the invention. Modulation may be an increase 
or a decrease in activity, a change in characteristics, or any other change in the biological, functional, or 
immunological properties of the polypeptide. 

Compounds which modulate the biological activity of a polypeptide of the invention may also be 
identified using the methods of the invention by comparing the pattern and level of expression of a nucleic 
acid molecule or polypeptide of the invention in biological samples, tissues and cells, in the presence, and 
in the absence of the compounds. 

Methods are also contemplated that identify compounds or substances (e.g. polypeptides) which 
interact with Hzf or Hhl regulatory sequences (e.g. promoter sequences, enhancer sequences, negative 
modulator sequences). 

The nucleic acid molecules, polypeptides, and substances and compounds identified using the 
methods of the invention, may be used to modulate the biological activity of a Hzf or Hhl Polypeptide, or 
a Hzf or Hhl Related Polypeptide of the invention, and they may be used in the treatment of conditions 
mediated by Hzf or Hhl such as hematopoietic disorders. Accordingly, the nucleic acid molecules, 
polypeptides, substances and compounds may be formulated into compositions for administration to 
individuals suffering from one or more of these conditions. Therefore, the present invention also relates to 
a composition comprising one or more of a polypeptide, nucleic acid molecule, or substance or compound 
identified using the methods of the invention, and a pharmaceutical ly acceptable carrier, excipient or 
diluent. A method for treating or preventing these conditions is also provided comprising administering to 
a patient in need thereof, a composition of the invention. 

The present invention in another aspect provides means necessary for production of gene-based 
therapies directed at the hematopoietic system. These therapeutic agents may take the form of 
polynucleotides comprising all or a portion of a nucleic acid molecule of the invention comprising a 
regulatory sequence of Hzf or Hhl placed in appropriate vectors or delivered to target cells in more direct 
ways. 

In accordance with a further aspect of the invention, there are provided processes for utilizing 
polypeptides or nucleic acid molecules, for in vitro purposes related to scientific research, synthesis of 
DNA and manufacture of vectors. 

These and other aspects, features, and advantages of the present invention should be apparent to 
those skilled in the art from the following drawings and detailed description. 
DESCRIPTION OF THE DRAWINGS 

The invention will be bener understood with reference to the drawings in which: 

Figure 1. UcZ Expression Pattern of Clone Hhl (A-D) and Hzf (E-F) during in vitro 
Hematopoietic Differentiation on 0P9 stromal cells, p-galactpsidase (p-gal) activity within the gene trap 
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clone Hhl was detected around day 5 within mesodermal colonies and was maintained in hematopoietic 
cells (day 12). The gene trap clone Hzf expressed low levels of P-gal activity in undifferentiated ES cells 
which was maintained within hematopoietic cells (day 2-12). Cultures were fixed with 0.25% 
gluteraldehyde and stained with X-gal after 2 days (A, E), 5 days (B, F), 8 days (C, G) and 12 days (D, H). 
5 Bar =50mm. 

Figure 2. LacZ Expression in Hhl ( A-D) and Hzf (E-H) Heterozygous Embryos. 
Wholemount lacZ expression within 9.5 d.p.c (A) Hhl heterozygous F, embryos demonstrated expression 
within the yolk sac and heart and also within the liver at 14.5 d.p.c (B). Histological analysis of the liver 
(C) and heart (D) of 14.5 d.p.c Hhl embryos. Wholemount lacZ expression within 9.5 d.p.c Hhl Fi 

10 heterozygous embryos (E) demonstrated expression within the somites, basal ganglia, apical ectodermal 

region of the limb buds and liver primordium and in the skin (Figure 2H). At 14.5 d.p.c (F) expression was 
seen in the trigeminal ganglia, thymus, salivary gland, spinal cord and spotty staining within the fetal liver. 
Histological analysis of the liver (G) and skin (H) of 14.5d.p.c Hzf embryo. Bar=100mm (A.C,E,H), 1mm 
(B, F), 500mm (D), 50mm (G). 

1 5 Figure 3. LacZ Expression within Bone Marrow Cells derived from Hhl and Hzf Heterozygous 

Adult Mice. X-gal staining of Hhl (A) and Hzf (B) bone marrow cells. LacZ expression detected by FDG 
staining (C) and cell morphology analysed by Wright-Giemsa staining (D) of the same sections of bone 
marrow cells from Hzf mice demonstrating the coincidence of /acZ expression within cells morphologically 
resembling megakaryocytes. Hhl bone marrow sections also demonstrated the same coincidence of 

2 0 expression within megakaryocytes (not shown). Bar = 50nun. 

Figure 4. LacZ Expression within Bone Marrow Cell derived Colonies. Bone marrow cells from 
Hzf and Hhl adult heterozygous mice were cultured in semi-solid agarose in the presence of TPO, IL-3 and 
SLF for the analysis of CFU-Mk. For the analysis of BFU-E, CFU-G. M, GM and GEMM bone marrow 
cells were plated in methylcellulose containing IL-3, IL-6, Epo and SLF. Colonies were either X-gal stained 

2 5 directly in the cultures or picked and then stained. High levels of X-gal staining were seen in CFU -Mk and 

CFU-GEMM for both Hzf and Hhl derived colonies. X-gal staining was seen in Hhl derived BFU-E but 
not in those derived from Hzf bone marrow cells. A low frequency of X-gal staining was detected in the 
macrophage component (depicted by the arrows) of CFU-M and GM for Hhl and only in CFU-M for Hzf. 
Photographs of cells from representative colonies are shown. 
30 Figures. 5 -RACE Sequence of and 

(5 A) Schematic diagram of the PTl/ATG gene trap vector and 5 -RACE methodology. 
(5B) The PTl/ATG vector contains a promoterless lacZ gene immediately upstream of a splice acceptor 
(SA) site and the /leo'^gene driven by the PGK-1 promoter. Random integration within an intron will 
generate a spliced fusion transcript between lacZ and the endogenous gene located at the site of vector 

3 5 integration. The fiision with lacZ enables primers to be designed (a, b and e ) to clone part of the upstream 

fused exon of the trapped gene by 5* rapid amplification of cDNA ends (RACE). The Kpnl and Pst 1 
restriction sites and primers (c & d, GTlacz-S and 4, respectively) used in the inverse PGR strategy for Hhl 
are also indicated. 
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{5C) 5 - RACE sequence of Hhi (bold). Arrows indicate sequences of the 12G10o and 12G10-6 primers 
used for 3'-RACE. (C) 5 - RACE sequence of H-f (bo\d). Arrows indicate sequences of 4H5- 1 and 4H5-2 
primers used for 3 -RACE 

Figure 6. Nucleotide Sequence and the Deduced Amino Acid Sequence of the Hhl and genes. (6A) 
5 The putative initiation codon of Hhl is located at bases 266-268. The Hhl open reading frame (ORF) 

encodes 298 amino acids. (6B) The longest ORF of W^begins with the ATG at nucleotide 29 and encodes 
396 amino acids. The polyadenylation signal is underlined and the three C2H2-type zinc finger motifs are 
boxed. The arrows depict the site of vector integration. 

Figure 7. Northern blot analysis of total RNA from wild type mouse adult tissues using probes from 
1 0 the Hhl (A) and H^ (B) genes. The GAPDH probe was used as an internal loading control (C). 

Figure 8. RT-PCR Analysis of Hzf and Hhl Expression in Normal and Trapped Mice. RNA was 
isolated from brain and amplified by RT-PCR from genotyped Hzf and Mhl mice. Amplified products were 
then southern blotted and hybridized with gene specific or lacZ probes. Amplified products were obtained 
from Hhl (A) and Hzf (B) homozygous mice using a primer 5* and another 3' to the vector integration site 
15 demonstrating that endogenous transcript 3* to the integration site had not been disrupted. Wild-tjpe mice 

were distinguished from mutant mice due to the lack of amplification of a Hrfor Hhl-lacZ fusion product 
using a primer 5' to the integration site and another complementary to the lacZ reporter gene. 
DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention there may be employed conventional molecular biology, 

2 0 microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained 

fully in the literature. See for example, Sambrook, Fritsch, & Maniatis, Molecular Cloning: A Laboratory 
Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y); DNA 
Cloning: A Practical Approach. Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M..J. 
Gait ed. 1984); Nucleic Acid Hybridization B.D. Hames & S.J. Higgins eds. (1985); Transcription and 
25 Translation B.D. Hames & S.J. Higgins eds (1984); Animal Cell Culture R.L Freshney, ed. (1986); 

Immobilized Cells and enzymes IRL Press, (1986); and B. Perbal, A Practical Guide to Molecular Cloning 
(1984). 

Nucleic Acid Molecules and Polvpeptides of the Invention 
Nucleic Acid Molecules 

^ 0 As hereinbefore mentioned, the invention provides isolated Hzf and Hhl nucleic acid molecules. 

The term "isolated" refers to a nucleic acid (or polypeptide) removed from its natural environment, purified 
or separated, or substantially free of cellular material or culture medium when produced by recombinant 
DNA techniques, or chemical reactants, or other chemicals when chemically synthesized. Preferably, an 
isolated nucleic acid is at least 60% free, more preferably at least 75% free, and most preferably at least 

3 5 90% free from other components with which they are naturally associated. The term "nucleic acid" is 

intended to include modified or unmodified DNA, RNA, including mRNAs, DNAs, cDNAs, and genomic 
DNAs, or a mixed polymer, and can be either single-stranded, double-stranded or triple-stranded. For 
example, a nucleic acid sequence may be a single-stranded or double-stranded DNA. DNA that is a mixture 
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of single-and double-stranded regions, or single-, double- and triple-stranded regions, single- and double- 
su-anded RNA, RNA that may be single-siranded, or more typically, double-stranded, or triple-stranded, 
or a mixture of regions comprising RNA or DNA, or both RNA and DNA. The strands in such regions may 
be from the same molecule or from different molecules. The DNAs or RNAs may contain one or more 
5 modified bases. For example, the DNAs or RNAs may have backbones modified for stability or fo: other 

reasons. A nucleic acid sequence includes an oligonucleotide, nucleotide, or polynucleotide. The term 
**nucleic acid molecule*' and in particular DNA or RNA, refers only to the primar\' and secondary structure 
and it does not limit it to any particular tertiary forms. 

In an embodiment of the invention an isolated nucleic acid molecule is contemplated which 
1 0 comprises: 

(i) a nucleic acid sequence encoding a polypeptide having substantial sequence identity 
with the amino acid sequence of SEQ. ID. NO. 2, or 4; 

(ii) a nucleic acid sequence complementary to (i); 

(iii) a nucleic acid sequence differing from any of (i) or (ii) in codon sequences due to 
15 the degeneracy of the genetic code; 

(iv) a nucleic acid sequence comprising at least 10, 15, 18, and preferably at least 20 
nucleotides capable of hybridizing to a nucleic acid sequence of SEQ. ID. NO. 1 or 
3 or to a degenerate form thereof; 

(v) a nucleic acid sequence encoding a truncation, an analog, an allelic or species 
2 0 variation of a polypeptide comprising the amino acid sequence of SEQ. II). NO. 2 

or 4; or 

(vi) a fragment, or allelic or species variation of (i), (ii) or (iii) 

In a specific embodiment, the isolated nucleic acid molecule comprises: 

(i) a nucleic acid sequence having substantial sequence identity or sequence 

2 5 similarity with a nucleic acid sequence of SEQ. ID. NO. 1 or 3; 

(ii) nucleic acid sequences complementary to (i), preferably complementary 
to the full nucleic acid sequence of SEQ. ID. NO. 1 or 3; 

(iii) nucleic acid sequences differing from any of the nucleic acid sequences of 
(i) or (ii) in codon sequences due to the degeneracy of the genetic code; or 

3 0 (iv) a fragment, or allelic or species variation of (i), (ii) or (iii). 

The term "complementary'' refers to the natural binding of nucleic acid molecules under 
permissive salt and temperature conditions by base-pairing. For example, the sequence " A-G-T" binds to 
the complementao' sequence 'T-C-A" .Complementarity between two single-stranded molecules may be 
"partial", in which only some of the nucleic acids bind, or it may be complete when total complementarity 
3 5 exists between the single stranded molecules. 

In a preferred embodiment the isolated nucleic acid comprises a nucleic acid sequence encoded 
by the amino acid sequence of SEQ. ID. NO. 2 or 4, or comprises the nucleic acid sequence of SEQ. ID. 
NO. i or 3 wherein T can also be U. 
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The terms "sequence similarity" or "sequence identity*' refer to the relationship between two or 
more amino acid or nucleic acid sequences^ determined by comparing the sequences, which relationship 
is generally known as **homology". Identity in the art also means the degree of sequence relatedness 
between amino acid or nucleic acid sequences, as the case may be, as determined by the match between 
5 strings of such sequences. Both identity and similarity can be readily calculated (Computational Molecular 

Biology, Lesk, A.M.. ed., Oxford University Press New York, 1988; Biocomputing: Informatics and 
Genome Projects, Smith, D.W. ed.. Academic Press, New York, 1993; Computer Analysis of Sequence 
Data, Part I, Griffin, A.M., and Griffin, H.G. eds. Humana Press, New Jersey, 1994; Sequence Analysis 
in Molecular Biology, von Heinje, G., Academic Press, New York, 1987; and Sequence Analysis Primer, 

10 Gribskov, M. and Devereux. J., eds. M. Stockton Press, New York, 1991). While there are a number of 

existing methods to measure identity and similarity between two amino acid sequences or two nucleic acid 
sequences, both terms are well known to the skilled artisan (Sequence Analysis in Molecular Biology, von 
Heinje, G., Academic Press, New York, 1987; Sequence Analysis Primer, Gribskov, M. and Devereux, J., 
eds. M. Stockton Press, New York, 1991; and Carillo, H., and Lipman, D. SIAM J. Applied Math., 

15 48: 1073. 1988). Preferred methods for determining identity are designed to give the largest match between 

the sequences tested. Methods to determine identity are codified in computer programs. Preferred computer 
program methods for determining identity and similarity between two sequences include but are not limited 
to the GCG program package (Devereux, J. et al. Nucleic Acids Research 12(1): 387. 1984), BLASTP, 
BLASTN, and FASTA (Atschul, S.R et al., J. Molec. Biol. 215:403, 1990). Identity or similarity may also 

20 be determined using the alignment algorithm of Dayhoff et al [Methods in Enzymology 91: :524-545 

(1983)]. 

Preferably, the nucleic acids of the present invention have substantial sequence identity using the 
preferred computer programs cited herein, for example greater than 50% , 60%, 70%, 80%, or 90% 
sequence identity, and preferably at least 90% , 95%, 96%, 97%, 98%, or 99% sequence identity to the 

25 sequence shown in SEQ. ID. NO. 1 or 3. 

Isolated nucleic acids encoding a Hzf or Hhl Polypeptide and comprising a sequence that differs 
from the nucleic acid sequence shown in SEQ. ID. NO. 1 or 3 due to degeneracy in the genetic code are 
also within the scope of the invention. Such nucleic acids encode equivalent polypeptides but differ in 
sequence from the sequence of SEQ. ID. NO. 1 or 3 due to degeneracy in the genetic code. As one example, 

3 0 DNA sequence polymorphisms within hrfor hhl may result in silent mutations that do not affect th.e amino 

acid sequence. Variations in one or more nucleotides may exist among individuals within a population due 
to natural allelic variation. Any and all such nucleic acid variations are within the scope of the invention. 
DNA sequence polymorphisms may also occur which lead to changes in the amino acid sequence of Hzf 
or Hhl Polypeptide. These amino acid polymorphisms are also within the scope of the present invention. 

35 In addition, species variations i.e. variations in nucleotide sequence naturally occurring among different 

species, are within the scope of the invention. 

Another aspect of die invention provides a nucleic acid molecule which hybridizes under selective 
conditions, (e.g. high stringency conditions), to a nucleic acid which comprises a sequence which encodes 
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a Hzf or Hhl Polypeptide of the invention. Preferably the sequence encodes the amino acid sequence shown 
in SEQ. ID. NO. 2 or 4 and comprises at least iO, preferably at least 15, more preferably at least 18, and 
most preferably at least 20 nucleotides. Selectivity of hybridization occurs with a certain degree of 
specificity rather than being random. Appropriate stringency conditions which promote DNA hybridization 
5 are known to those skilled in the art, or can be found in Current Protocols in Molecular Biology, John 

Wiley & Sons. N.Y. (1989), 6.3.1-6.3.6. For example, hybridization may occur at 30^C in 750 mM NaCI, 
75mM uisodium citrate, and 1% SDS, preferably 37°C in 500mM NaCI, 500 mM trisodium citrate, \% 
SDS, 35% formamide. and lOO^g/ml denatured salmon sperm DNA (ssDNA), and more preferably 42*'C 
in 250 mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 fig/ml ssDNA. Useful 

1 0 variations on these conditions will be readily apparent to those skilled in the art. 

The stringency may be selected based on the conditions used in the wash step. Wash step 
stringency conditions may be defined by salt concentration and by temperature. Generally, wash stringency 
can be increased by decreasing salt concentration or by increasing termperature. By way of example, a 
stringent salt concentration for the wash step is preferably less than about 30 mM NaCl and 3mM trisodium 

15 citrate, and more preferably less than about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent 

temperature conditions will generally include temperatures of a least about 25°C, more preferably at least 
about 68°C. In a preferred embodiment, the wash steps will be carried out at 42°C in 15 mM NaCl. 1.5mM 
trisodium citrate, and 0. 1% SDS. In a more preferred embodiment the wash steps are carried out at 68°C 
in 15 mM NaCl, 1.5mM trisodium citrate, and 0.1% SDS. Variations on these conditions will be readily 

2 0 apparent to those skilled in the art. 

It will be appreciated that the invention includes nucleic acid molecules encoding a Hzf or Hhl 
Polypeptide, or a Hzf or Hhl Related Polypeptide, including truncations of the polypeptides, allelic and 
species variants, and analogs of the polypeptides as described herein. In particular, fragments of a nucleic 
acid of the invention are contemplated that are a stretch of at least about 10, 15, or 18, and most preferably 

2 5 at least 20 nucleotides, more typically at least 50 to 200 nucleotides but less than 2 kb. It will further be 

appreciated that variant forms of the nucleic acid molecules of the invention which arise by alternative 
splicing of an mRNA corresponding to a cDNA of the invention are encompassed by the invention. 

An isolated nucleic acid molecule of the invention which comprises DNA can be isolated by 
preparing a labeled nucleic acid probe based on all or part of the nucleic acid sequence shown in SEQ. ID. 

3 0 NO. 1 or 3. The labeled nucleic acid probe is used to screen an appropriate DNA library (e.g. a cDNA or 

genomic DNA library). For example, a cDNA library can be used to isolate a cDNA encoding a Hzf or Hhl 
Polypeptide, or a Hzf or Hhl Related Polypeptide by screening the library with the labeled probe using 
standard techniques. Alternatively, a genomic DNA library can be similarly screened to isolate a genomic 
clone encompassing a hrfor hhl gene. Nucleic acids isolated by screening of a cDNA or genomic DNA 
3 5 library can be sequenced by standard techniques. 

An isolated nucleic acid molecule of the invention that is DNA can also be isolated by selectively 
amplifying a nucleic acid of the invention. "Amplifying" or "amplification " refers to the production of 
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additional copies of a nucleic acid sequence and is generally carried out using polymerase chain reaction 
(PCR) technologies well known in the art (Dieffenbach, C. W, and G. S. Dveksler (1995) PGR Primer, a 
Laboratory Manual, Cold Spring Harbor Press, Plainview. N.Y.). In particular, it is possible to design 
synthetic oligonucleotide primers from the nucleotide sequence shown in SEQ. ID. NO. i or (e.g. SEQ. ID. 
5 Nos. 5-14) for use in PCR. A nucleic acid can be amplified from cDNA or genomic DNA usinj; these 

oligonucleotide primers and standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis. cDNA may be prepared 
from mRNA, by isolating total cellular mRNA by a variety of techniques, for example, by using the 
guanidinium-thiocyanate extraction procedure of Chirgwin et al. Biochemistry, 18. 5294-5299 (1979). 

10 cDNA is then synthesized from the mRNA using reverse transcriptase (for example, Moloney MLV 

reverse transcriptase available from Gibco/BRL, Bethesda, MD, or AMV reverse transcriptase available 
firom Seikagaku America, Inc.. St. Petersburg, FL). 

An isolated nucleic acid molecule of the invention which is RNA can be isolated by cloning a 
cDNA encoding a Hzf or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide into an appropriate vector 

15 which allows for transcription of the cDNA to produce an RNA molecule which encodes a Hzf or Hhl 

Polypeptide, or a Hzf or Hhl Related Polypeptide. For example, a cDNA can be cloned downsU-eam of a 
bacteriophage promoter, (e.g. a T7 promoter) in a vector, cDNA can be transcribed in vitro with T7 
polymerase, and the resultant RNA can be isolated by conventional techniques. 

A nucleic acid molecule of the invention may be engineered using methods known in the art to 

2 0 alter the Hzf or Hhl encoding sequence for a variety of purposes including modification of the cloning, 

processing, and/or expression of the gene product. Procedures such as DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to 
engineer the nucleic acid molecules. Mutations may be introduced by oligonucleotide-mediated site- 
directed mutagenesis to create for example new restriction sites, alter glycosylation patterns, change codon 

2 5 preference, or produce splice variants. 

Nucleic acid molecules of the invention may be chemically synthesized using standard techniques. 
Methods of chemically synthesizing polydeoxynucleotides are known, including but not limited to solid- 
phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA 
synthesizers (See e.g., Itakura et al. U.S. Patent No. 4,598.049; Caruthers et al. U.S. Patent No. 4.458.066; 

3 0 and Itakura U.S. Patent Nos. 4,401,796 and 4,373.071). 

Determination of whether a particular nucleic acid molecule is a hsfor hhl gene or encodes a Hzf 
or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide can be accomplished by expressing the cDNA in 
an appropriate host cell by standard techniques, and testing the expressed polypeptide in the methods 
described herein. A Hrf or Hhl cDNA or cDNA encoding a Hzf or Hhl Polypeptide, or a Hzf or Hhl 
3 5 Related Polypeptide can be sequenced by standard techniques, such as dideoxynucleotide chain termination 

or Maxam-Gilbert chemical sequencing, to determine the nucleic acid sequence and the predicted amino 
acid sequence of the encoded polypeptide. 
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The nucleic acid molecules of the invention may be extended using a partial nucleotide sequence 
and various PCR-based methods known in the art to detect uptsiream sequences such as promoters and 
regulatory elements. For example, restriction-site PGR which uses universal and nested primers to amplify 
unknown sequences from genomic DNA within a cloning vector may be employed (See Sarkar. G, PGR 
5 Methods Applic. 2:318-322, 1993). Inverse PGR which uses primers that extend in divergent directions 

to amplify unknown sequences from a circularized template may also be used. The template in inverse PGR 
is derived from restriction fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA (See e.g. Lagersirom, M., at al, PGR Methods Applic. 1:111-119, 1991). Other methods 
for retrieving unknown sequences are known in the art (e.g. Parker, J.D. et al. Nucleic Acids Res. 19:305- 

1 0 306» 1991). In addition, PGR, nested primers, and PROMOTERFINDER libraries (Glontech. Palo Alto. 

Galifomia) may be used to walk genomic DNA. The method is useful in finding intron/exon junctions and 
avoids the need to screen libraries. 

It is preferable when screening for full-length cDNAs to use libraries that have been size-selected 
to include larger cDNAs. For situations in which an oligo d(T) library does not yield a full-length cDNA. 

15 it is preferable to use random-primed libraries which often include sequences containing the 5' regions of 

genes. Genomic libraries may be useful for extending the sequence into 5 'non-translated regulatory regions. 

Gommercially available capillary electrophoresis systems may be employed to analyse the size 
or confirm the sequence of PGR or sequencing products. The system may use flowable polymers for 
electrophoretic separation, four different nucleotide-specific, laser-stimulated fluorescent dyes, and a charge 

2 0 coupled device camera for detection of the emitted wavelengths. Gonmiercially available software (e.g. 

GENOTYPER and SEQUENGE NAVIGATOR, Perkin-EImer) may convert the output/light intensity to 
electrical signal, and the entire process from loading of samples, and computer analysis and elecu-onic data 
display may be computer controlled. This procedure may be particularly useful for sequencing small DNA 
fragments which may be present in limited amounts in a particular sample. 

2 5 In accordance with one aspect of the invention, a nucleic acid is provided comprising a h^or hhl 

regulatory sequence such as a promoter sequence. 

The invention contemplates nucleic acid molecules comprising all or a portion of a nucleic acid 
molecule of the invention comprising a regulatory sequence of a Hzf or Hhl contained in appropriate 
vectors. The vectors may contain heterologous nucleic acid sequences. ''Heterologous nucleic acid" refers 

3 0 to a nucleic acid not naturally located in the cell, or in a chromosomal site of the cell. Preferably, the 

heterologous nucleic acid includes a nucleic acid foreign to the cell. 

In accordance with anodier aspect of the invention, the nucleic acid molecules isolated using the 
methods described herein are mutant hzf or hhl gene alleles. For example, the mutant alleles may be isolated 
from individuals either known or proposed to have a genotype which contributes to the symptoms of a 
3 5 hematopoietic disorder. Mutant alleles and mutant allele products may be used in therapeutic and diagnostic 

methods described herein. For example, a cDNA of a mutant Hzfor Hhl gene may be isolated using PGR 
as described herein, and the DNA sequence of the muunt allele may be compared to the normal allele to 
ascertain the mutation(s) responsible for the loss or alteration of function of the mutant gene product. A 
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genomic library can also be constructed using DNA from an individual suspected of or known to carry a 
mutant allele, or a cDNA library can be consuiicted using RNA from tissue known, or suspected to express 
the muunt allele. A nucleic acid encoding a normal H:f or Hhl gene or any suitable fragment thereof, may 
then be labeled and used as a probe to identify the corresponding mutant allele in such libraries. Clones 
5 containing mutant sequences can be purified and subjected to sequence analysis. In addition, an expression 

library can be constructed using cDNA from RNA isolated from a tissue of an individual known or 
suspected to express a mutant /z^or hlil allele. Gene products from putatively mutant tissue may be 
expressed and screened, for example using antibodies specific for a Hzf or Hhl Polypeptide, or a Hzf or 
Hhl Related Polypeptide as described herein. Library clones identified using the antibodies can be purified 

1 0 and subjected to sequence analysis. 

Antisense molecules and ribozymes are contemplated within the scope of the invention. They may 
be prepared by any method known in the art for the synthesis of nucleic acid molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 

1 5 sequences encoding Hzf or Hhl Polypeptide. Such DNA sequences may be incorporated into a wide variety 

of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize antisense RNA constitutively or inducibly can be introduced into cell lines, cells, 
or tissues. RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5 ' and/or 3 ' ends of 

2 0 the molecule or the use of phosphorothioate or 2' O-methyl rather dian phosphodiesterase linkage s within 

the backbone of the molecule. This concept is inherent in the production of FN As and can be extended in 

all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, 

as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and 

uridine which are not as easily recognized by endogenous endonucleases. 
25 Polypeptides of the Invention 

The polypeptides of the invention are primarily expressed in hematopoietic lineages. Hzf contains 

three C2H2-type zinc fingers and it is primarily expressed in megakaryocytes and multipotential progenitors. 

Hhl is expressed in most myeloid lineages with particularly high expression in multipotential progenitors, 

erythroid, and megakaryocytic cells. 
30 Amino acid sequences of polypeptides of the invention comprise the sequences of SEQ. ID. 

N0.2., or SEQ, ID. NO. 4. In addition to the amino acid sequences as shown SEQ. ID, N0.2, or SEQ. ID. 

NO. 4, the polypeptides of the present invention include truncations of the polypeptides of the invention, 

and analogs, and homologs of the polypeptides and truncations thereof as described herein. 

Truncated polypeptides may comprise peptides having an amino acid sequence of at least five 

3 5 consecutive amino acids in SEQ.ID. NO. 2 or 4 where no amino acid sequence of five or more, six or more, 

seven or more, or eight or more, consecutive amino acids present in the fragment is present in a polypeptide 
other than a Hzf or Hhl Polypeptide. In an embodiment of the invention the fragment is a stretch of amino 
acid residues of at least 10 to 100, preferably at least 10 to 75, more preferably 10 to 50, and most 
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preferably 12 to 20 contiguous amino acids from particular sequences such as the sequences shown in 
SEQ.ID. NO. 2 or 4. The fragments may be immunogenic and preferably are not immunoreaciive with 
antibodies that are immunoreactive to polypeptides other than Hzf or Hhl. 

The truncated polypeptides may have an amino group (-NH2), a hydrophobic group (for example, 
5 carbobenzoxyl, dansyl, or T-butyloxycarbonyl). an acetyl group, a 9-fluorenylmethoxy-carbonyl (PMOC) 

group, or a macromolecuie including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates at the amino terminal end. The truncated polypeptides may have a carboxyl group, an amido 
group, a T-butyloxycarbonyl group, or a macromolecuie including but not limited to lipid-fatty acid 
conjugates, polyethylene glycol, or carbohydrates at the carboxy terminal end. 

1 0 The polypeptides of the invention may also include analogs, and/or truncations thereof as 

described herein, which may include, but are not limited to the polypeptides, containing one or more amino 
acid substitutions, insertions, and/or deletions. Amino acid substitutions may be of a conserved or non- 
conserved nature. Conserved amino acid substitutions involve replacing one or more amino acids with 
amino acids of similar charge, size, and/or hydrophob icily characteristics. When only conserved 

15 substitutions are made the resulting analog should be functionally equivalent to the native polypeptide. 

Non-conserved substitutions involve replacing one or more amino acids with one or more amino acids 
which possess dissimilar charge, size, and/or hydrophobic ity characteristics. 

One or more amino acid insertions may be introduced into a polypeptide of the invention. Amino 
acid insertions may consist of single amino acid residues or sequential amino acids ranging from about 2 

20 to 15 amino acids in length. 

Deletions may consist of the removal of one or more amino acids, or discrete portions from the 
polypeptide sequence. The deleted amino acids may or may not be contiguous. The lower limit length of 
the resulting analog with a deletion mutation is about 10 amino acids, preferably 100 amino acids. 

An allelic variant at the polypeptide level differs from another polypeptide by only one, or at most, 

25 a few amino acid substitutions. A species variation of a polypeptide of the invention is a variation which 

is naturally occurring among different species of an organism. 

The polypeptides of the invention also include homologs and/or truncations thereof as described 
herein. Such homologs include polypeptides whose amino acid sequences are comprised of the amino acid 
sequences of regions from odier species that hybridize under selective hybridization conditions (see 

3 0 discussion of selective and in particular stringent hybridization conditions herein) with a probe used to 

obtain a polypeptide of the invention. These homologs will generally have the same regions which are 
characteristic of a polypeptide of the invention. It is anticipated that a polypeptide comprising an amino 
acid sequence which is at least 75% identity or at least 80% similarity, preferably 80 to 90% identity or 
90% similarity, more preferably 90 to 95% identity or 95% similarity, and most preferably 95 to 99% 

3 5 identity or 99% similarity with an amino acid sequence of SEQ. ID. N0.2 or SEQ. ID. NO 4 will be a 

homolog. A percent amino acid sequence similarity or identity is calculated using the methods described 
herein, preferably the computer programs described herein. For example, a percent amino acid sequence 
homology or identity is calculated as the percentage of aligned amino acids that match the reference 
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sequence, where the sequence alignment has been determined using the alignment algorithm of Dayhoff 
et al; Methods in Enzymology 91: 524-545 (1983). 

The invention also contemplates isoforms of the polypeptides of the invention. An isoform 
contains the same number and kinds of amino acids as the polypeptide of the invention, but the isoform has 
5 a different molecular structure. The isoforms contemplated by the present invention are those having the 

same properties as a polypeptide of the invention as described herein. 

The present invention also includes polypeptides of the invention conjugated with a selected 
polypeptide, or a marker polypeptide (see below) to produce fusion polypeptides. Additionally, 
immunogenic portions of a polypeptide of the invention are within the scope of the invention. 
10 A polypeptide of the invention may be prepared using recombinant DNA methods. Accordingly, 

the nucleic acid molecules of the present invention having a sequence which encodes a polypeptide of the 
invention may be incorporated in a known manner into an appropriate expression vector which ensures 
good expression of the polypeptide. Possible expression vectors include but are not limited to cosmids, 
plasmids, or modified viruses (e.g. replication defective retroviruses, adenoviruses and adeno-associated 
15 viruses), so long as the vector is compatible with the host cell used. 

The invention therefore contemplates a vector of the invention containing a nucleic acid molecule 
of the invention, and optionally the necessary regulatory sequences for the transcription and translation of 
die inserted polypeptide-sequence. Suitable regulatory sequences may be derived from a variety of sources, 
including bacterial, fungal, viral, mammalian, or insect genes (For example, see the regulatory sequences 

2 0 described in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 

Diego, CA (1990). Selection of appropriate regulatory sequences is dependent on the host cell chosen as 
discussed below, and may be readily accomplished by one of ordinary skill in the art. The necessary 
regulatory sequences may be supplied by a native polypeptide and/or its flanking regions. 

The invention further provides a vector comprising a DNA nucleic acid molecule of the invention 

25 cloned into the vector in an antisense orientation. That is, the DNA molecule is linked to a regulatory 

sequence in a manner which allows for expression, by transcription of the DNA molecule, of an RNA 
molecule which is antisense to a nucleic acid sequence of a nucleic acid molecule of the invention. 
Regulatory sequences linked to the antisense nucleic acid can be chosen which direct the continuous 
expression of the antisense RNA molecule in a variety of cell types, for instance a viral promoter and/or 

30 enhancer, or regulatory sequences can be chosen which direct tissue or cell type specific expression of 

antisense RNA. 

The expression vector of the invention may also contain a marker gene which facilitates the 
selection of host cells transformed or transfected with a vector of the invention. Examples of marker genes 
are genes encoding a polypeptide such as 04 18 and hygromycin which confer resistance to certain drugs, 

3 5 P-galactosidase, chloramphenicol acetyltransferase, firefly luciferase, or an immunoglobulin or portion 

thereof such as the Fc portion of an immunoglobulin preferably IgG. The markers can be introduced on a 
separate vector from the nucleic acid of interest. 

The vectors may also contain genes which encode a fusion moiety which provides increased 
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expression of the recombinant polypeptide; increased solubility of the recombinant polypeptide; and aid 
in the purification of the target recombinant polypeptide by acting as a ligand in affinity purification. For 
example, a proteolytic cleavage site may be added to the target recombinant polypeptide to allow separation 
of the recombinant polypeptide from the fusion moiety subsequent to purification of the fusion polypeptide. 
Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New 
England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST), maltose E binding polypeptide, or polypeptide A, respectively, to the recombinant 
polypeptide. 

The vectors may be introduced into host cells to produce a transformed or transfected host cell. 
The terms "transfected and "transfeciion" encompass the introduction of nucleic acid (e.g. a vector) into 
a cell by one of many standard techniques. A cell is "transformed" by a nucleic acid when the transfected 
nucleic acid effects a phenotypic change. Prokaryotic cells can be transfected or transformed with nucleic 
acid by, for example, electroporation or calcium-chloride mediated transformation. Nucleic acid can be 
introduced into mammalian cells via conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation. DEAE-dextran-mediated transfection, lipofectin, electroporation or 
microinjection. Suitable methods for transforming and transfecting host cells can be found in Sambrook 
et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Uboratory press 
(1989)), and other laboratory textbooks. 

Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, 
the polypeptides of the invention may be expressed in bacterial cells such as £. coli, insect cells (using 
baculovirus), yeast cells, or mammalian cells. Other suitable host cells can be found in Goeddel, Gene 
Expression Technology: Methods inEnzymology 185, Academic Press, San Diego. CA (199 1). 

A host cell may also be chosen which modulates the expression of an inserted nucleic acid 
sequence, or modifies (e.g. glycosylation or phosphorylation) and processes (e.g. cleaves) the polypeptide 
in a desired fashion. Host systems or cell lines may be selected which have specific and characteristic 
mechanisms for post-translational processing and modification of polypeptides. For example, eukaryotic 
host cells including CHO, VERO, BHK, HeLA. COS, MDCK, 293, 3T3, and WI38 may be used. For long- 
term high-yield stable expression of the polypeptide, cell lines and host systems which stably express the 
gene product may be engineered. 

Host cells and in particular cell lines produced using the methods described herein may be 
particularly useful in screening and evaluating substances or compounds that modulate the activity of a 
polypeptide of the invention. 

The polypeptides of the invention may also be expressed in non-human transgenic animals 
including but not limited to mice. rats, rabbits, guinea pigs, micro-pigs, goats, sheep, pigs, non-human 
primates (e.g. baboons, monkeys, and chimpanzees) (see Hammer et al. (Nature 315:680-683, 1985), 
Palmiier et al (Science 222:809-814, 1983), Brinster et al. (Proc Natl. Acad. Sci USA 82:44384442, 1985), 
Palmiter and Brinster (Cell. 41:343-345, 1985) and U.S. Patent No. 4.736,866). Procedures known in the 
art may be used to introduce a nucleic acid molecule of the invention encoding a polypeptide of the 
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invention into animals to produce the founder lines of transgenic animals. Such procedures include 
pro nuclear microinjection, retrovirus mediated gene transfer into germ lines, gene targeting in embryonic 
stem cells, electroporation of embryos, and sperm-mediated gene transfer. 

The present invention contemplates a transgenic animal that carries a nucleic acid molecule of the 
5 invention in all their cells, and animals which carry the transgene in some bui not all their cells. The 

transgene may be integrated as a single transgene or in concatamers. The transgene may be selectively 
introduced into and activated in specific cell types (See for example, Lasko et al, 1992 Proc. Natl. Acad. 
Sci. USA 89: 6236). The transgene may be integrated into the chromosomal site of the endogenous gene 
by gene targeting. The transgene may be selectively introduced into a particular cell type inactivating the 

1 0 endogenous gene in that cell type (See Gu ct al Science 265: 103-106). 

The expression of a recombinarit polypeptide of the invention in a transgenic animal may be 
assayed using standard techniques. Initial screening may be conducted by Southern Blot analysis, or PGR 
methods to analyze whether the u-ansgene has been integrated. The level of mRNA expression in the tissues 
of transgenic animals may also be assessed using techniques including Northern blot analysis of tissue 

15 samples, in situ hybridization, and RT-PCR. Tissue may also be evaluated immunocytochemically using 

antibodies against Hzf or Hhl Polypeptide. 

The polypeptides of the invention may also be prepared by chemical synthesis using techniques 
well known in the chemistry of polypeptides such as solid phase synthesis (Merrifield, 1964. J. Am. Chem. 
Assoc. 85:2149-2154), or synthesis in homogenous solution (Houbenweyl, 1987. Methods of Organic 

20 Chemistry, ed. E. Wansch, Vol. 15 I and II. Thieme. Stuttgart). 

N-terminal or C-terminal fusion or chimeric polypeptides comprising a polypeptide of the 
invention conjugated with other molecules, such as polypeptides m (e.g. markers) may be prepared by 
fusing, through recombinant techniques, the N-terminal or C-terminal of a polypeptide of the invention, and 
the sequence of a selected polypeptide or marker polypeptide with a desired biological function. The 

2 5 resultant fusion polypeptides contain a polypeptide of the invention fused to the selected polypeptide or 

marker polypeptide as described herein. Examples of polypeptides which may be used to prepare fusion 
polypeptides include immunoglobulins. glutathione-S-transferase (GST), hemagglutinin (HA), and 
truncated myc. 
Antibodies 

30 A polypeptide of the invention can be used to prepare antibodies specific for the polypeptides. 

Antibodies can be prepared which bind a distinct epitope in an unconsented region of the polypeptide. An 
unconserved region of the polypeptide is one that does not have substantial sequence homology to other 
polypeptides. A region from a conserved region such as a well-characterized sequence can also be used to 
prepare an antibody to a conserved region of a polypeptide of the invention. Antibodies having specificity 

35 for a polypeptide of the invention may also be raised from fusion polypeptides created by expressing fusion 

polypeptides in host cells as described herein. 

The invention can employ intact monoclonal or polyclonal antibodies, and immunologically active 
fragments (e.g. a Fab or (Fab)2 fragment), an antibody heavy chain, and antibody light chain, a genetically 
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engineered single chain Fy molecule (Ladner et al, U.S. Pai. No. 4,946,778), or a chimeric antibody, for 
example, an antibody which contains the binding specificity of a murine antibody, but in which the 
remaining portions are of human origin. Antibodies including monoclonal and polyclonal antibodies, 
fragments and chimeras, may be prepared using methods known to those skilled in the art. 
5 Applications of the Nucleic Acid Molecules. Polypeptides, and Antibodies of the Invention 

The nucleic acid molecules. Hzf or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide, and 
antibodies of the invention may be used in the prognostic and diagnostic evaluation of conditions requiring 
modulation of a nucleic acid or polypeptide of the invention, including disorders of the hematopoietic 
system, and the identification of subjects with a predisposition to such conditions (See below). Methods 

10 for detecting nucleic acid molecules and polypeptides of the invention can be used to monitor such 

conditions by detecting and localizing the nucleic acids and polypeptides. It would also be apparent to one 
skilled in the art that the methods described herein may be used to study the developmental expression of 
the polypeptides of the invention and, accordingly, will provide further insight into the role of the 
polypeptides. The applications of the present invention also include methods for the identification of 

15 substances or compounds that modulate the biological activity of a polypeptide of the invention (See 

below). The substances, compounds, antibodies etc may be used for the treatment of conditions requiring 
modulation of polypeptides of the invention including hematopoietic disorders. (See below). 
Diagnostic Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of conditions 

2 0 requiring modulation of a nucleic acid or polypeptide of the invention (e.g. hematopoietic disord<;rs). and 

the identification of subjects with a predisposition to such conditions. Such methods may, for example, 
utilize nucleic acid molecules of the invention, and fragments thereof, and antibodies directed against 
polypeptides of the invention, including peptide fragments. In particular, the nucleic acids and antibodies 
may be used, for example, for: (1) the detection of the presence of Hzf or Hhl mutations, or the detection 

25 of either over- or under-expression of hzf or hhl mRNA relative to a non-disorder state or the qualitative 

or quantitative detection of alternatively spliced forms oihifor hhl transcripts which may correlate with 
certain conditions or susceptibility toward such conditions; or (2) the detection of either an over- or an 
under-abundance of a polypeptide of the invention relative to a non-disorder state or the presence of a 
modified (e.g., less than full length) polypeptide of the invention which correlates with a disorder state, or 

30 a progression toward a disorder state. 

The methods described herein may be performed by utilizing pre-packaged diagnostic kits 
comprising at least one specific nucleic acid or antibody described herein, which may be conveniendy used, 
e.g.. in clinical settings, to screen and diagnose patients and to screen and identify those individuals 
exhibiting a predisposition to developing a disorder. 

3 5 Nucleic acid-based detection techniques and peptide detection techniques are describe*! below. 

The samples that may be analyzed using the methods of the invention include those which are known or 
suspected to express hzf or hhl or contain a polypeptide of the invention. The methods may be performed 
on biological samples including but not limited to cells, lysates of cells which have been incubated in cell 
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culture, chromosomes isolated from a eel! (e.g. a spread of metaphase chromosomes), genomic DNA (in 
solutions or bound to a solid support such as for Southern analysis), RNA (in solution or bound to a solid 
support such as for northern analysis), cDNA (in solution or bound to a solid support), an extract from cells 
or a tissue, and biological fluids such as serum, urine, blood, and CSF. The samples may be derived from 
5 a patient or a culture. 

The polypeptides of the invention are primarily expressed in hematopoietic lineages, and in 
particular in megakaryocytes. The polypeptides of the invention have a role in proliferation, differentiation, 
activation and/or metabolism of cells of the hematopoietic lineages. Therefore, the methods described 
herein for detecting nucleic acid molecules and polypeptides can be used to monitor proliferation, 

1 0 differentiation, activation and/or metabolism of cells of the hematopoietic lineage (e.g. megakaryocytes) 

by detecting and localizing polypeptides and nucleic acid molecules of the invention. The methods 
described herein may be used to study the developmental expression of a polypeptide of the invention and, 
accordingly, will provide further insight into the role of the polypeptide in the hematopoietic system. 

The nucleic acid molecules and polypeptides of the invention are markers for hematopoietic cells 

15 and accordingly the antibodies and probes described herein may be used to label these cells. 

Methods for Detecting Nucleic Acid Molecules of the Invention 

The nucleic acid molecules of the invention allow those skilled in the art to construct nucleotide 
probes for use in the detection of nucleic acid sequences of the invention in biological materials. Suitable 
probes include nucleic acid molecules based on nucleic acid sequences encoding at least 5 sequential amino 

2 0 acids from regions of the Hzf or Hhl Polypeptide, or a Hzf or Hhl Related Polypeptide (see SEQ. ID. No. 

I or 3), preferably they comprise 15 to 30 nucleotides. Examples of probes are shown in SEQ. ID. NOs. 
5 to 14. 

A nucleotide probe may be labeled with a detectable substance such as a radioactive label that 
provides for an adequate signal and has sufficient half-life such as ^^p^ 3h^ 14c or the like. Other 

2 5 detectable substances that may be used include antigens that are recognized by a specific labeled antibody, 

fluorescent compounds, enzymes, antibodies specific for a labeled antigen, and luminescent compounds. 
An appropriate label may be selected having regard to the rate of hybridization and binding of the probe 
to the nucleotide to be detected and the amount of nucleotide available for hybridization. Labeled probes 
may be hybridized to nucleic acids on solid supports such as nitrocellulose filters or nylon membranes as 

3 0 generally described in Sambrook et al» 1989, Molecular Cloning, A Laboratory Manual (2nd ed.). The 

nucleic acid probes may be used to detect Hzf or Hhl genes, preferably in human cells. The nucleotide 
probes may also be useful for example in the diagnosis or prognosis of conditions such as hematopoietic 
disorders, and in monitoring the progression of these conditions, or monitoring a therapeutic treatment. 

The probe may be used in hybridization techniques to detect a hzf or hhl gene. The technique 
3 5 generally involves contacting and incubating nucleic acids (e.g. recombinant DNA molecules, cloned 

genes) obtained from a sample from a patient or other cellular source with a probe of the present invention 
under conditions favourable for the specific annealing of the probes to complementary sequences in the 
nucleic acids. After incubation, the non-annealed nucleic acids are removed, and the presence of nucleic 
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acids that have hybridized to the probe if any are detected. 

The detection of nucleic acid molecules of the invention may involve the amplification of specific 
gene sequences using an amplification method such as PGR, followed by the analysis of the amplified 
molecules using techniques known to those skilled in the an. Suitable primers can be routinely designed 
5 by one of skill in the an. 

Genomic DNA may be used in hybridization or amplification assays of biological samples to 
detect abnormalities involving hzforhhl structure, including point mutations, insenions, deletions, and 
chromosomal rearrangements. For example, direct sequencing, single stranded conformational 
polymorphism analyses, heteroduplex analysis, denaturing gradient gel electrophoresis, chemical mismatch 
10 cleavage, and oligonucleotide hybridization may be utilized. 

Genotyping techniques known to one skilled in the art can be used to type polymorphisms that are 
in close proximity to the mutations in a hzf or hhl gene. The polymorphisms may be used to identify 
individuals in families that are likely to carry mutations. If a polymorphism exhibits linkage disequalibrium 
with mutations in the hzfor hhl gene, it can also be used to screen for individuals in the general population 
15 likely to carry mutations. Polymorphisms which may be used include restriction fragment length 

polymorphisms (RFLPs), single-base polymorphisms, and simple sequence repeat polymorphisms (SSLPs). 

A probe of the invention may be used to directly identify RFLPs. A probe or primer of the 
invention can additionally be used to isolate genomic clones such as YACs, BACs, PACs, cosmids, phage 
or plasmids. The DNA in the clones can be screened for SSLPs using hybridization or sequencing 
2 0 procedures. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of hzfor hhl expression. For example, RNA may be isolated from a cell type or tissue 
known to express hzfor hhl and tested utilizing the hybridization (e.g. standard Northern analyses) or PGR 
techniques referred to herein. The techniques may be used to detect differences in transcript size that may 

2 5 be due to normal or abnormal alternative splicing. The techniques may be used to detect quantitative 

differences between levels of full length and/or alternatively splice transcripts detected in normal 
individuals relative to those individuals exhibiting symptoms of a disease. 

The primers and probes may be used in the above-described methods in situ i.e directly on tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections. 

3 0 Oligonucleotides or longer fragments derived from any of the nucleic acid molecules of the 

invention may be used as targets in a microarray. The microarray can be used to simultaneously monitor 
the expression levels of large numbers of genes and to identify genetic variants, mutations, and 
polymorphisms. The information from the microarray may be used to determine gene function, to 
understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the activities 
35 of therapeutic agents. 

The preparation, use, and analysis of microarrays are well known to a person skilled in the art. 
(See, for example, Brennan, T. M. et al. (1995) U.S. Pat. No. 5,474,796; Schena, et al. (1996) Proc. Natl. 
Acad. Sci, 93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251 1 16; Shalon, D. et 
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al. (I 995) PCT application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. 94:2150-2155; 
and Heller, M. J. et al. (1997) U.S. Pat. No. 5.605.662.) 
Methods for Detecting Polypeptides 

Antibodies specifically reactive with a Hzf or Hhl Polypeptide, a Hzf or Hhl Related Polypeptide. 
5 or derivatives, such as enzyme conjugates or labeled derivatives, may be used to detect Hzf or Hhl 

Polypeptides or Hzf or Hhl Related Polypeptides in various biological materials. They may be used as 
diagnostic or prognostic reagents and they may be used to detect abnormalities in the level of Hzf or Hhl 
Polypeptides or Hzf or Hhl Related Polypeptides, expression, or abnormalities in the structure, and/or 
temporal, tissue, cellular, or subcellular location of the polypeptides. Antibodies may also be used to screen 

1 0 potentially therapeutic compounds in vitro to determine their effects on a condition such as a hemaiopoietic 

disorder. In vitro immunoassays may also be used to assess or monitor the efficacy of particular therapies. 
The antibodies of the invention may also be used in vitro to determine the level of Hzf or Hhl Polypeptide 
or Hzf or Hhl Related Polypeptide expression in cells genetically engineered to produce a Hzf or Hhl 
Polypeptide, or Hzf or Hhl Related Polypeptide. 

1 5 The antibodies may be used in any known immunoassays that rely on the binding interaction 

between an antigenic determinant of a polypeptide of the invention, and the antibodies. Examples of such 
assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA). immunofluorescence, 
immunoprecipitation, latex agglutination, hemagglutination, and histochemical tests. The antibodies may 
be used to detect and quantify polypeptides of the invention in a sample in order to determine their role in 

2 0 particular cellular events or pathological states, and to diagnose and treat such pathological states. 

In particular, the antibodies of the invention may be used in immuno-histochemical analyses, for 
example, at the cellular and sub-subcellular level, to detect a polypeptide of the invention, to localise it to 
particular cells and tissues, and to specific subcellular locations, and to quantitate the level of expression. 
Cytochemical techniques known in the art for localizing antigens using light and electron 

2 5 microscopy may be used to detect a polypeptide of the invention. Generally, an antibody of the invention 

may be labeled with a detectable substance and a polypeptide may be localised in tissues and cells based 
upon the presence of the detectable substance. Various methods of labeling polypeptides are known in the 
art and may be used. Examples of detectable substances include, but are not limited to. the following: 
radioisotopes (e.g.. ^ H, C, ^^S. ^^^I, *^*I), fluorescent labels (e.g., FITC. rhodamine, lanthanide 
30 phosphors), luminescent labels such as luminol; enzymatic labels (e.g.. horseradish peroxidase, P- 

galactosidase, luciferase, alkaline phosphatase, acetylcholinesterase), biotinyl groups (which can be 
detected by marked avidin e.g., streptavidin containing a fluorescent marker or enzymatic activity that can 
be detected by optical or calorimetric methods), predetermined polypeptide epitopes recognized by a 
secondary reporter (e.g., leucine zipper pair sequences, binding sites for secondary antibodies, metal 

3 5 binding domains, epitope tags). In some embodiments, labels are attached via spacer arms of various 

lengths to reduce potential steric hindrance. Antibodies may also be coupled to elecdron dense substances, 
such as ferritin or colloidal gold, which are readily visualised by electron microscopy. 

The antibody or sample may be inunobilized on a carrier or solid support which is capable of 
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immobilizing ceils, antibodies, etc. For example, the carrier or support may be nitrocellulose, or glass, 
poly aery lamides. gabbros, and magnetite. The support material may have any possible configuration 
including spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external surface 
of a rod), or flat (e.g. sheet, test strip). Indirect methods may also be employed in which the primary 
5 antigen-antibody reaction is amplified by the inu*oduction of a second antibody, having specificity for the 

antibody reactive against a polypeptide of the invention. By way of example, if the antibody having 
specificity against a polypeptide of the invention is a rabbit IgG antibody, the second antibody may be goat 
anti-rabbit gamma-globulin labeled with a detectable substance as described herein. 

Where a radioactive label is used as a detectable substance, a polypeptide of the invention may 

10 be localized by radioautography. The results of radioautography may be quantitated by determining the 

density of particles in the radioautographs by various optical methods, or by counting the grains. 
Methods for Identifying or Evaluating Substances/Compounds 

The methods described herein are designed to identify substances or compounds that modulate 
the biological activity of a Hzf or HHl Polypeptide or Hzf or HHl Related Polypeptide. **Modulate" refers 

15 to a change or an alteration in the biological activity of a polypeptide of the invention. Modulation may be 

an increase or a decrease in activity, a change in characteristics, or any other change in the biological, 
functional, or inununological properties of the polypeptide. 

Substances and compounds identified using the methods of the invention include but are not 
limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random peptide 

2 0 libraries and combinatorial chemisu-y-derived molecular libraries made of D- and/or L-configuration amino 

acids, phosphopeptides (including members of random or partially degenerate, directed phosphopeptide 
libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, single chain 
antibodies, fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope-binding 
fragments thereof)], and small organic or inorganic molecules. A substance or compound may be an 

2 5 endogenous physiological compound or it may be a natural or synthetic compound. 

Substances which modulate a Hzf or Hhl Polypeptide or Hzf or Hhl Related Polypeptide can be 
identified based on their ability to associate with a Hzf or Hhl Polypeptide or Hzf or Hhl Related 
Polypeptide. Therefore, the invention also provides methods for identifying substances which associate with 
a Hzf or Hhl Polypeptide or Hzf or Hhl Related Polypeptide, Substances identified using the methods of 

3 0 the invention may be isolated, cloned and sequenced using conventional techniques. A substance that 

associates with a polypeptide of the invention may be an agonist or antagonist of the biological or 
immunological activity of a polypeptide of the invention. 

The term "agonist", refers to a molecule that increases the amount of, or prolongs the duration of. 
the activity of the polypeptide. The term "antagonist" refers to a molecule which decreases the biological 
35 or immunological activity of the polypeptide. Agonists and antagonists may include proteins, nucleic acids, 

carbohydrates, or any other molecules that associate with a polypeptide of the invention. 

Substances which can associate with a polypeptide of the invention may be identified by reacting 
the polypeptide with a test substance which potentially associates with the polypeptide, under conditions 
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which permit ihe association, and removing and/or detecting the associated polypeptides and substance. 
Subsiance-polypepiide complexes, free substance, non-complexed polypeptide, or activated polypeptide 
may be assayed. Conditions which permit the formation of complexes may be selected having regard to 
factors such as the nature and amounts of the substance and the polypeptide. 
5 Substance-polypepiide complexes, free substances or non-complexed polypeptides may be 

isolated by conventional isolation techniques, for example, salting out, chromatography, electrophoresis, 
gel filtration, fractionation, absorption, polyacrylamide gel electrophoresis, agglutination, or combinations 
thereof. To facilitate the assay of the components, antibody against the polypeptide or the substance, or 
labelled polypeptide, or a labelled substance may be utilized. The antibodies, polypeptides, or substances 

1 0 may be labelled with a detectable substance as described above. 

A polypeptide, or the substance used in the method of the invention may be insolubilized. For 
example, the polypeptide, or substance may be bound to a suitable carrier such as agarose, cellulose, 
dextran, Sephadex, Sepharose, carboxymethyl cellulose polystyrene, filter paper, ion-exchange resin, plastic 
film, plastic tube, glass beads, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid copolymer, 

1 5 ethylene-maleic acid copolymer, nylon, silk. etc. The carrier may be in the shape of. for example, a tube, 

test plate, beads, disc, sphere etc. The insolubilized polypeptide or substance may be prepared by reacting 
the material with a suitable insoluble carrier using known chemical or physical methods, for example, 
cyanogen bromide coupling. 

The invention also contemplates a method for evaluating a compound for its ability to modulate 

20 the biological activity of a polypeptide of the invention, by assaying for an agonist or antagonist (i.e. 

enhancer or inhibitor) of the association of the polypeptide with a substance that associates with the 
polypeptide. The basic method for evaluating if a compound is an agonist or antagonist of the association 
of a polypeptide of the invention and a substance that associates with the polypeptide, is to prepare a 
reaction mixture containing the polypeptide and the substance under conditions which permit the formation 

25 of substance- polypeptide complexes, in the presence of a test compound. The test compound may be 

initially added to the mixture, or may be added subsequent to the addition of the polypeptide and substance. 
Control reaction mixtures without the test compound or with a placebo are also prepared. The formation 
of complexes is detected and the formation of complexes in the control reaction but not in the reaction 
mixture indicates that the test compound interferes with the interaction of the polypeptide and substance. 

3 0 The reactions may be carried out in the liquid phase or the polypeptide, substance, or test compound may 

be immobilized as described herein. 

It will be understood that the agonists and antagonists i.e. inhibitors and enhancers that can be 
assayed using the methods of the invention may act on one or more of the binding sites on the polypeptide 
or substance including agonist binding sites, competitive antagonist binding sites, non-competitive 

3 5 antagonist binding sites, or allosteric sites. 

The invention also makes it possible to screen for antagonists that inhibit the effects of an agonist 
of the interaction of the polypeptide with a substance which is capable of binding to the polypeptide. Thus, 
the invention may be used to assay for a compound that competes for the same binding site of the 
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polypeptide. 

The reagents suitable for applying the methods of the invention to evaluate compounds that 
modulate a polypeptide of the invention may be packaged into convenient kits providing the necessary 
materials packaged into suitable containers. The kits may also include suitable supports useful in 
5 performing the methods of the invention. 

Compositions and Treatments 

The polypeptides, nucleic acid molecules, substances or compounds identified by the methods 
described herein, antibodies, and anti sense nucleic acid molecules of the invention may be used for 
modulating the biological activity of a polypeptide or nucleic acid molecule of the invention. The 

1 0 polypeptides etc. may have particular application in the treatment of conditions requiring modulation of 

cells of the hematopoietic lineage i.e. hematopoietic disorders. Treatable disorders include anemia, 
thrombocytopenia, leukopenia, myelofibrosis, hypoplasia, disseminated inu*avascular coagulation, immune, 
thromocytopenic purpura, myelodysplasia, erythrocytopenia, erthordegenerative disorders, 
crythroblastopenia, leukoerylhroblastosis, erythroclasis. thalassemia, and granulocytopenia. The 

15 polypeptides etc. may be used to modulate production of blood cells in situations where a patient has a 

disease such as AIDS, which has caused alteration in normal blood cell levels, or in clinical settings in 
which enhancement of hematopoietic populations is desired, such as in conjunction with bone marrow 
transplant, or in the treatment of aplasia or myelosuppression caused by radiation, chemical treatment, or 
chemotherapy. The polypeptides, nucleic acids, etc. may also be used to treat patients in which the desired 

2 0 result is the inhibition of a hematopoietic pathway such as for the treatment of myeloproliferative or other 

proliferative disorders of blood forming organs such as thromocythemias. polycythemias, and leukemias. 

The polypeptides, nucleic acids, etc. may have specific application in the regulation of 
megakaryocytopoiesis. Abnormal enhancement of megakaryocytopoiesis leads to thrombocytosis which 
is implicated in a number of debilitating blood cell disorders, including leukemias and carcinomas, 
25 Therefore, the polypeptides, nucleic acids, etc. may be used in the prevention and treatment of 

polycythemiz vera, megakaryocytic leukemia, ovarian adenocarcinoma, and lupus erythematosus, and in 
the prevention and treatment of many common diseases where functioning of platelets requires modulation 
(e.g. inhibition) such as thrombosis, atherosclerosis, hemorrhage, embolism, myocardial infarction, post- 
myocardial infarction syndrome, and post-cardiac surgery. 

3 0 Accordingly, the polypeptides, substances, antibodies, and compounds may be formulated into 

pharmaceutical compositions for adminstration to subjects in a biologically compatible form suitable for 
administration in vivo. By "biologically compatible form suitable for administration in vivo" is meant a 
form of the substance to be administered in which any toxic effects are outweighed by the therapeutic 
effects.. The substances may be administered to living organisms including humans, and animals. 
3 5 Administration of a therapeutically active amount of the pharmaceutical compositions of the present 

invention is defmed as an amount effective, at dosages and for periods of time necessary to achieve the 
desired result. For example, a therapeutically active amount of a substance may vary according to factors 
such as the disease state, age. sex, and weight of the individual, and the ability of antibody to elicit a desired 



WOOO/49145 



PCT/CAOO/00171 



- 24 - - 

response in the individual. Dosage regima may be adjusted to provide the optimum therapeutic response. 
For example, several divided doses may be administered daily or the dose may be proportionally reduced 
as indicated by the exigencies of the therapeutic situation. 

The active substance may be administered in a convenient manner such as by injection 
5 (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal 

administration. Depending on the route of administration, the active substance may be coated in a material 
to protect the compound from the action of enzymes, acids and other natural conditions which may 
inactivate the compound. 

The compositions described herein can be prepared by perse known methods for the preparation 

10 of pharmaceutical ly acceptable compositions which can be administered to subjects, such that an effective 

quantity of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. 
Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easion, Pa., USA 1985). On this basis, the 
compositions include, albeit not exclusively, solutions of the substances or compounds in association with 

1 5 one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a 

suitable pH and iso-osmotic with the physiological fluids. 

After pharmaceutical compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of a polypeptide, nucleic 
acids, etc. of the invention, such labeling would include amount, frequency, and method of administration. 

2 0 The polypeptides, nucleic acids, etc. and compositions of the invention may be used alone, or in 

combination with another pharmaceutically active agent such as, for example, cytokines, neurotrophins, 
interleukins, etc. For example, the polypeptides etc. and compositions may be used in conjunction with any 
of a number of the above-referenced factors which are known to induce stem cell or other hematopoietic 
precursor proliferation, or factors acting on later cells in the hematopoietic pathway including but not 

2 5 limited to hemopoietic maturation factor, thrombopoietin, stem cell factor, erythropoietin, G-CSF, CjM-CSF 

etc. 

The invention also contemplates an antibody that specifically binds the therapeutically active 
ingredient used in a treatment or composition of the invention. The antibody may be used to measure the 
amount of the therapeutic molecule in a sample taken from a patient for purposes of monitoring the course 
30 of therapy. 

The nucleic acid molecules encoding a polypeptide of the invention or any fragment thereof, or 
antisense sequences may be used for therapeutic purposes. Antisense to a nucleic acid molecule encoding 
a polypeptide of the invention may be used in situation to block the synthesis of the polypeptide. In 
particular, cells may be transformed with sequences complementary to nucleic acid molecules encoding a 

3 5 Hzf or Hhl Polypeptide or Hzf or Hhl Related Polypeptide. Thus, antisense sequences may be used to 

modulate activity or to achieve regulation of gene function. This technology is well known in the art, and 
sense or antisense oligomers or larger fragments, can be designed from various locations along the coding 
or regulatory regions of sequences encoding a polypeptide of the invention. 
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Expression vectors may be derived from retroviruses, adenoviruses, herpes or vaccinia viruses or 
from various bacterial plasmids for delivery of nucleic acid sequences to the target organ, tissue, or cells. 
Vectors that express antisense nucleic acid sequences of Hzf or Hhl Polypeptides can be constructed using 
techniques well known to those skilled in the art. 
5 Genes encoding a Hzf or Hhl Polypeptide can be turned off by transforming a cell or tissue with 

expression vectors that express high levels of a nucleic acid molecule or fragment thereof which encodes 
a polypeptide of the invention. Such constructs may be used to introduce untranslatable sense or antisense 
sequences into a cell. Even if they do not integrate into the DNA. the vectors may continue to transcribe 
RNA molecules until all copies are disabled by endogenous nucleases. Transient expression may last for 

1 0 extended periods of time (e.g a month or more) with a non-replicating vector, or if appropriate rei)lication 

elements are part of the vector system. 

Modification of gene expression may be achieved by designing antisense molecules, DNA, RNA, 
or PNA, to the control regions of a hzf or hhl gene i.e. the promoters, enhancers, and introns. Preferably 
the antisense molecules are oligonucleotides derived from the U-anscription initiation site (e.g. between 

1 5 positions -10 and + 10 from the start site). Inhibition can also be achieved by using triple-helix basti-pairing 

techniques. Triple helix pairing causes inhibition of the ability of the double helix to open sufficiently for 
the binding of polymerases, transcription factors, or regulatory molecules (see Gee J.E. et al (1994) In: 
Huber, B.E, and B.I. Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, 
N. Y.). An antisense molecule may also be designed to block translation of mRNA by inhibiting binding 

20 of the transcript to the ribosomes. 

Ribozymes, enzymatic RNA molecules, may be used to catalyze the specific cleavage of RNA. 
Ribozyme action involves sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. For example, hanunerhead motif ribozyme molecules 
may be engineered that can specifically and efficiently catalyze endonucleolytic cleavage of sequences 

2 5 encoding a polypeptide of the invention. 

Specific ribosome cleavage sites within any RNA target may be initially identified by scanning 
the target molecule for ribozyme cleavage sites which include the following sequences: GUA, GUU, and 
GUC. Short RNA sequences of between 15 and 20 ribonucleotides corresponding to the region of the 
cleavage site of the target gene may be evaluated for secondary structural features which may render the 

3 0 oligonucleotide inoperable. The suitability of candidate targets may be evaluated by testing accessibility 

to hybridization with complementary oligonucleotides using ribonuclease protection assays. 

Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in 
cell cultures or with experimental animals, such as by calculating the ED50 ( the dose therapeutically 
effective in 50% of the population) or LD50 (the dose lethal to 50% of the population) statistics. The 
3 5 therapeutic index is the dose ratio of therapeutic to toxic effects and it can be expressed as the EDsq/LDso 

ratio. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 

The invention also provides methods for studying the function of a polypeptide of the invention. 
Cells, tissues, and non-human animals lacking in expression or partially lacking in expression of a nucleic 
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acid molecule or gene of the invention may be developed using recombinant expression vectors of the 
invention having specific deletion or insertion mutations in the gene. A recombinant expression vector 
may be used to inactivate or alter the endogenous gene by homologous recombination, and thereby create 
a deficient cell, tissue, or animal. 
5 Null alleles may be generated in cells, such as embryonic stem cells by deletion mutation. A 

recombinant gene may also be engineered to contain an insertion mutation that inactivates the gene. Such 
a construct may then be introduced into a cell, such as an embryonic stem cell, by a technique such as 
transfection, electroporation, injection etc. Cells lacking an intact gene may then be identified, for example 
by Southern blotting. Northern Blotting, or by assaying for expression of the encoded polypeptide using 

1 0 the methods described herein. Such cells may then be fused to embryonic stem cells to generate transgenic 

non-human animals deficient in a polypeptide of the invention. Germline transmission of the mutation may 
be achieved, for example, by aggregating the embryonic stem cells with early stage embryos, such as 8 cell 
embryos, in vitro; transferring the resulting blastocysts into recipient females and; generating germline 
transmission of the resulting aggregation chimeras. Such a mutant animal may be used to define specific 

15 cell populations, developmental patterns and in vivo processes, normally dependent on gene expression. 

The invention thus provides a transgenic non-human mammal all of whose germ cells and somatic 
cells contain a recombinant expression vector that inactivates or alters a gene encoding a Hzf or Hhl 
Polypeptide. In an embodiment the invention provides a transgenic non-human mammal all of whose germ 
cells and somatic cells contain a recombinant expression vector that inactivates or alters a gene encoding 

20 a Hzf Polypeptide resulting in a Hzf associated pathology. Further the invention provides a transgenic non- 

human mammal which doe not express a Hzf or Hhl polypeptide of the invention. In an embodiment, the 
invention provides a transgenic non-human mammal which doe not express a Hzf polypeptide of the 
invention resulting in a Hzf associated pathology. 

An Hzf associated pathology refers to a phenotype observed for a Hzf homozygous mutant. The 

2 5 phenotype is similar to the phenotype observed after sustained brain damage resulting from stroke in 

humans. The phenotype of an Hzf homozygous mutant mouse includes the following characteristics: 
hemorrhage frequently observed in the brain, smaller size, animals are able to lift their heads only to the 
right or left; and their bodies tremor when placed on unstable surfaces. The membranes of megakaryocytes 
from the Hzf homozygous mutant animals are also aberrant. 
30 A transgenic non-human animal includes but is not limited to mouse, rat, rabbit, sheep, hamster, 

dog, cat, goat, and monkey, preferably mouse. 

The invention also provides a transgenic non-human animal assay system which provides a model 
system for testing for an agent that reduces or inhibits a pathology associated with an Hzf or Hhl 
Polypeptide, preferably a Hzf associated pathology, comprising: 

3 5 (a) administering the agent to a transgenic non-human animal of the invention; and 

(b) determining whether said agent reduces or inhibits the pathology (e.g. Hzf associated 
pathology) in the transgenic mouse relative to a transgenic mouse of step (a) which has not 
been administered the agent. 
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The agent may be useful in the treatment of hematopoietic disorders or disorders requiring 
modulation of megakaryocytopoiesis, or it may be used to modulate production of blood cells as discussed 
herein. The agents may be particularly useful in the treatment and prophylaxis of thrombosis, hemorrhage, 
embolism, stroke, atherosclerosis, myocardial infarction, post-myocardial infarction syndrome, and post- 
5 cardiac surgery. The agents may be incorporated in a pharmaceutical composition as described herein. 

A polypeptide of the invention may be used to support the survival, growth, migration, and/or 
differentiation of cells expressing the polypeptide. Thus, a polypeptide of the invention may be used as a 
supplement to support, for example hematopoietic cells in culture. 

The following non-limiting examples are illustrative of the present invention: 
1 0 Examples 
Example 1 

MATERIAL AND METHODS 

Cells. Rl embryonic stem cells from the 129/Sv strain [34] were maintained on a layer of mitomycin C- 
treated embryonic fibroblasts in Dulbecco's modified Eagle's culture medium, supplemented with leukemia 
15 inhibitory factor, 15% fetal calf serum» L-glutamine, and P-mercaptoethanol as previously descritted [35]. 

The 0P9 stromal cell line were culnired in a-MEM containing 20% fetal calf serum [33]. 
Generation of Trapped ES Cell Lines. The plasmid PTl-ATG (PTl henceforth) contains the En-2 splice 
acceptor site positioned immediately upstream of die lacZ reporter gene with an ATG translational start site 
[36]. The bacterial neomycin- resistance {neo) gene is driven by the phosphoglycerate kinase-I (PGK-1) 

2 0 promoter. After electroporation of the ES cells with the PTl vector and selection in G418. drug-resistant 

clones were plated into 96- well plates. Upon confluency, the cells were expanded into three 96- well plates. 
One plate was frozen; cells in the second plate were stained with X-galactosidase (X-gal); and cells in the 
third plate were used for hematopoietic differentiation on OP9 cells. 

Analysis of fi-galactosidase Activity, p-galactosidase (p-gal) activity was analyzed by staining with 5- 
25 bromo 4-chloro 3-indolyl P-galactoside (X-gal; Boehringer Mannheim, Germany). Briefly, cells were 

washed once with phosphate buffered saline (PBS) and fixed in 0.2% glutaraldehyde, 2mM MgCl2 and 
5mM EGTA in lOOmM Na2HP04 for 10 min. at room temperature. Cell cultures were then washed three 
times for 5 min. each in IQOmM Na2HP04, 0.02% NP-40 and 2 mM MgCl2 and stained overnight in X-gal 
staining solution (1 mg/ml X-gaU 5 mM KsFe (CN)6, 5 mM K4Fe(CN)6, 2mM MgCl2, 0.02% NP-40 in 

3 0 lOOmM Na2HP04) directly in 96- or 6- well plates at 37*^C. Following X-gal staining, samples were washed 

overnight at 4*^C. Detection of p-gal activity in embryos was performed as described above except the 
fixative included 1.5% formaldehyde and fixation was performed for between 30-60 min. depending on the 
size of the embryo and washed 3 times each for 30 min. 

Screening Clones for Hematopoietic Expression, Differentiation of ES cells on OP9 stromal cells was 
3 5 performed essentially as previously described [33]. The drug-resistant clones within each row of a 96- well 

plate which did not demonstrate X-ga) staining in undifferentiated cultures (white clones) were pooled. The 
pooled clones were seeded onto confluent 0P9 cell layers in 6-well plates at a density of lO'* cells/well in 
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0P9 media. Cultures were trypsinized at day 5, and 10^ cells/well were iransfened onto fresh 0P9 su-oma. 

Mesoderm- like colonies and hematopoietic cell clusters were observed on day 5 and day 10, respectively. 

tflcZ expression was examined at day 5 and day 10. The pooled clones that demonstrated P-gal activity 
during the first round of screening were thawed from frozen stocks, and each clone was tested individually 
5 for expression of lacZ during hematopoietic differentiation. Clones which demonstrated X-gal staining as 

undifferentiated cells were analysed individually on 0P9 cells. 

Molecular cloning of the trapped genes by RACEy inverse PCR and cDNA library screening, RNA was 
prepared from either undifferentiated or differentiated ES cells using Trizol (Gibco/BRL) according to 
manufacturer's instructions. 5* RACE was performed using the 5' RACE kit (Gibco/BRL), according to 

1 0 manufacturer's instructions with modifications previously described'^. 5' RACE products were subcloned 

into the CloneAmp plasmid (Gibco/BRL) and both strands sequenced using the AutoRead Sequencing kit 
(Pharmacia) and an A.L.F. DNA Sequencer (Pharmacia). Sequences were analyzed by comparison to the 
non-redundant GenBank and EST databases of the NCBI using the BLASTN program. 3' RACE was 
performed using a 3' RACE kit (Life Technologies) and two nested primers complimentary to each 5' 

1 5 RACE fragment (see Figure 5). Wild type total RNA from day 9 embryos or mouse brain was used as 

template. Specific primers were as follows: 12G10-5. 5-GCAGAGCTCTGGGCGGCGGGT-3*; 12G10-6. 
5'-CTCGGGCGTCTGACAGAGTTG-3*; 4H5-1, 5'-CCCGGGACAGAACGACGC-3'; 4H5-2, 5- 
CGGAACCCCGCjGAGCCAGG-3' (SEQ ID NOs. 5-8, respectively). To obtain a long amplified product, 
the Expand™ long template PCR system (Boehringer Mannheim) was used. The RACE-PCR products 

2 0 were cloned into the pCR2.1 vector (Invitrogen). The cDNA inserts were sequenced on both strands as 

previously described. Inverse PCR was performed essential as described by von Melchner et aL, 
(1990)". Genomic DNA from the Hhl ES cell line was digested with PstI and self-ligated at a 
concentration of Ifxg/ml to obtain circular molecules. 20ng of self-ligated circular DNA was used for the 
PCR using the Expand™ long template PCR system (Boehringer Mannheim), with primer sets of GTlacZ- 
25 3 [5'-ATTACGCCAC}CTGGCGAAAGG-31 (SEQ.ID.N0.9) and GTlacZ-4 [5'- 

GATTGGTC5GCGACGACTCCTG-31 (SEQ.ID.NO.IO). Following digestion with Pstl and Kpnl to 
isolate flanking sequences just upstream of vector integration sites, the DNA was separated 
electophoretically on a 0.7% agarose gel and hybridized with a pGEM7 probe. 

A 12.5 d.p.c. mouse embryonic cDNA library in the AEXIox vector (mouse strain NIH Swiss; 

3 0 Novagen) was screened by standard procedures [38] using Hhl RACE fragments as probes. Phage plaques 

(1.0x10^) were transferred to nylon filters (Amersham) and hybridized for 2hr at 68^C with the 
radiolabeled probe using Quikhyb solution (Stratagene). Filters were washed at 60^C: twice for 15 min 
in 2x SSC and for 1 hour in O.lx SSC containing 0.1% SDS. Positive clones were plaque purified and the 
inserts recovered and sequenced as described above. GenBank search and sequence analysis were 
3 5 performed using Fasta, Translate, Bestfit, and Pileup softwares from the Wisconsin Package ((jenetics 

Computer Group. Inc., Madison, WI). The ORFs of Hhl and Hzf were submitted to the National Center 
for Biotechnology Information XREF internet service (http://www.ncbi.nlm.nih.gov/XREFdb/) where the 



\VO00/49145 



PCT/CAOO/00171 



- 29 - - 

sequences were compared to the most recent updates of the EST database. In addition, the sequences were 
compared to all public databases using the BLAST algorithm. 

Generation of Chimeras. The ES cell clones. Hhi and Hzf were aggregated with CDl diploid embryos 
as previously described [34] and transferred to foster mothers to generate several strong male chimeras. 
5 Chimeric males were bred to CDl females and tail DNA of Fi and F2 offspring was analyzed by southern 

blotting and hybridization to En-2/LacZ and RACE fragment probes. For the Hhl clone, genotyping was 
performed using densitometry, comparing the intensity of the LacZ band to the internal standard, £/i-2. The 
reliability of genotyping by quantitative Southern blot analysis was confirmed by test breeding [29]. For 
the Hzf line, EcoRI-digested DNA was hybridized with the RACE probe which detected a polymorphism 
1 0 between the wild-type and mutant alleles. All the analyses described were performed in the CD- l/129Sv 

hybrid background. 

RNA Analysis, Northern blot analysis was carried out according to standard procedures [38]. RT-PCR was 
performed using an RNA PCR kit (Perkin-Elmer Cetus) using total RNA from the brains of F2 mice as 
template. Combinations of specific primers were as follows: 12G10-6; 12G10-14, 5- 
15 TATCTTCAGCTGTGGCTTCCC-3* (SEQ.ID.NO.il); GT-/acZ-2A 5'-CCGTCGACTC 

TGGCGCCGCTGCTCTGTCAG-3'(SEQ.ID.N0.12); for amplification of Hhl products and 4H5-2, 5'- 
CGGAACCCCGGGAGCCAGG-3(SEQ.ID.N0. 13); 4H5-6, 5'-TCTGGGGATCCTGGAGCTGGAC- 
3'(SEQ.ID.N0.14); GT-/^jcZ-2A for Hzf products. 

Immunofluorescence. Cells were stained with the fluorescent P-gal substrate, fluorescein di-g-D- 
20 galactopyranoside (FDG) as previously described [39]. Cells (10^) were washed twice with phosphate 

buffered saline (PBS) and resuspended in 40|il PBS containing 5% FCS. After 10 min of preincubation 
at 37OC, cells were incubated with 4Qp\ prewarmed FDG (2mM in water) for 75 seconds at 37^C. FDG 
loading was terminated by adding 720^1 ice cold PBS containing 5% FCS. Cells were cytospun onto glass 
slides and examined under a fluorescent microscope, prior to morphological examination by Wright-Giemsa 

2 5 staining. Cell surface expression of CD61 was perfomed by indirect immunofluorescence. Bone marrow 

cells were washed in PBS and incubated with anti-CD61 (PharMingen, ON, CA) for 30 minutes, washed 
twice, then incubated for 30 minutes with fluorescein isothiocyanate (FITC) conjugated mouse anti-hamster 
IgG monoclonal antibody (PharMingen, ON. CA). Cells were washed twice and analysis was performed 
using a FACS cell sorter (Becton and Dickinson, Mountain View, CA) 

3 0 CFU-C Analysis, Bone marrow cells were cultured in methylcellulose containing the growth factors IL-3. 

IL-6, Epo and SCF (METHOCULT GF M3434; Stemcell Technologies). After 7-16 days, individual 
colonies were picked and stained with X-gal in a 96-well plate. Colony types were determined 
morphologically using Wright-Giemsa staining. For analysis of CFU-Mk, bone marrow cells were plated 
in 0.3% MegaCult serum-free base agarose (StemCell Technologies HCC-4701K) in the presence of 
3 5 recombinant murine IL-3 (20ng/ml), TPO (40ng/ml) and SLF (50ng/ml) [40] and cultured for 18-21 days. 

The complete culmre was stained with X-gal and CFU-Mk were examined microscopically. 
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RESULTS 

Isolation of hematopoietic gene trap clones. The PTl gene trap vector, which contains a splice acceptor 
site immediately upstream of a promoterless lacZ reporter gene and the neo gene driven by the PGK-J 
promoter, was introduced into ES cells (clone Rl) by electroporation. After selection of neo^ colonies. 
5 clones were replica plated for freezing, analysis of lacZ expression by undifferentiated ES cells, and 

differentiation on OP9 stromal cells. Of 1.350 neo"* ES clones, 95% did not express lacZ (white clones) 
and 5% were lacZ positive (blue clones) in the undifferentiated state. Both white and blue clones were 
allowed to differentiate on 0P9 cells and lacZ expression was monitored throughout hematopoietic 
differentiation. Two of the originally white clones (0.2% of the total number of clones) now expressed lacZ 

10 upon hematopoietic differentiation (Table 1). One of these clones, 7H7, expressed lacZ at day 5 in 

mesoderm-like colonies which was subsequently downregulated during differentiation and not detected in 
hematopoietic cells by day 10 of culture. The other clone, Hhl, expressed lacZ at both the mesodermal and 
hematopoietic stages of differentiation (Figure 1). Approximately one-third of the hematopoietic clusters 
derived from Hhl ES cells expressed lacZ 12 days after intitiation of co-culture. 

15 Of the 70 clones that expressed lacZ in the undifferentiated state (Table 1), 97% (68/70) 

downregulated /acZ expression upon differentiation while two of the originally blue clones, Hzf and 4A11, 
expressed lacZ throughout the differentiation into hematopoietic cells (Table 1). Clone Hzf expressed lacZ 
in only a few cells at the undifferentiated (day 2) and mesodermal stages (day 5). Interestingly, lacZ 
expression in Hzf was restricted to large cells located at the periphery of day 12 hematopoietic clusters 

2 0 (Figure 1). Clone 4A11 was subsequently shown to be ubiquitously expressed in vivo and was not 

characterized further. The two hematopoietic regulated gene trap clones Hzf and Hhl are described below. 
In vivo Expression Analysis. Previously it has been shown that the in vitro patterns of lacZ expression 
in trapped ES clones accurately predicts the expression patterns observed in vivo [18]. To determine 
whether the in vitro expression patterns of the trapped genes in the ES clones Hhl and Hzf were also 

2 5 recapitulated in vivo, chimeric mice were generated between these ES clones and diploid embryos; and the 

trapped alleles were transmitted through the germ line to produce Fj heterozygotes. In Hhl heterozygotes 
(HhU+), lacZ was expressed in the yolk sac and heart primordium at 8.5 days postcoitum (d.p.c) and in the 
heart, dorsal root ganglia and fetal liver after 12.5 d.p.c (Figure 2A & B). Virtually all cells in 14.5 d.p.c 
fetal liver were positive for X-gal staining (Figure 2C), and lacZ expression in the heart was localized to 

3 0 the endocardium and myocardium (Figure 2D). In Hzf heterozygotes {Hzfl+), lacZ was expressed in the 

somites, basal ganglia, apical ectodermal region of the limb buds and liver primordium in 9.5 d.p.c. 
embryos (Figure 2E). Expression was also observed in the skin (Figure 2H), trigeminal ganglia, thymus, 
salivary gland and spinal cord, with punctate staining in the fetal liver around 14.5 d.p.c (Figure 2F). The 
punctate staining in the liver was due to the restricted staining of large polynuclear cells resembling 
3 5 megakaryocytes (Figure 20). 

To characterize further the expression pattern of the trapped genes within the hematopoietic 
compartment, bone marrow cells were isolated from adult mice and analyzed for lacZ expression. 
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Unfraciionaied cells obtained from HzJI-^ mice expressed high levels of lacZ in about 2% of total bone 
marrow cells while up to 80% (ranging from very high to low levels of expression) of cells obtained from 
mice were /acZ-positive (for example, see Figures 3A & B). Both lines demonstrated a coincidence 
of /acZ expression, detected by FDG staining, within cells morphologically resembling megakaryocytes 
5 (Figures 3C & D). 

This pattern of lacZ expression was further supported by the expression of lacZ within most 
megakaryocyte colony forming units (CFU-Mk) generated from Hhlf-k- and //^+ bone marrow in a serum- 
free agarose assay in the presence of TPO, lL-3 and SLF (Figure 4). In addition to expression in 
megakaryocytes, lacZ was expressed in a population of cells within CFU-M and CFU-GEMM colonies, 

10 with no expression observed in CFU-GM, G or BFU-E colonies derived from bone marrow cells 

(Figure 4). In contrast, lacZ was expressed to high levels in BFU-E and cells within CFU-GEMM colonies 
from Hhlh bone marrow cells. Expression was also observed in some cells within CFU-M and the 
macrophage component of CFU-GM (Figure 4); however, granulocytes did not express lacZ. Thus, the in 
vitro hematopoietic expression of both Hhl and Hzf was recapitulated in vivo, 

15 To define more precisely the cells that expressed lacZ, bone marrow cells were selected on the 

basis of their expression of the megakaryocyte cell surface marker, CD61, by fluorescence-activated cell 
sorting. Approximately 5% of bone marrow cells derived from //^+ mice expressed lacZ. About 0.5-1% 
of the /^cZ-positive cells did not express CD6L Within the CD61* fraction obtained from bone 
marrow cells, approximately 14% of the cells expressed lacZ whereas 33% demonstrated lacZ expression 

20 in the CD6r fraction. 

Sequence analysis oiHzf and HhL To determine the primary sequence of the Hrfand Hhl trapped genes, 
lacZ fusion transcripts were cloned by 5' rapid amplification of cDNA ends (RACE; Figure 5). Primers 
corresponding to the sequence obtained by 5 'RACE were used for subsequent 3 -RACE to obtain ftirther 
sequence downstream of the vector integration site (Figure 5 & 6). The full length coding sequence for Hzf 

2 5 was obtained by 3' RACE (Figure 6B .). A combination of RACE, inverse PCR and cDNA library screening 

was used to obtain partial sequences of Hhl with a full length coding sequence being obtained using 3' 
RACE (Figure 6A). 

Hhl. The Hhl sequence is 1,258 base pairs (bp) in length and contains an uninterrupted open reading frame 
(ORF) which encodes a putative polypeptide of 298 amino acids (Figure 6A). The sequence of this ORF 

3 0 does not share significant similarity with any known genes; however, the 3' end of Hhl shares 88% DNA 

similarity over 126 bases to a murine expressed sequence tag (EST) isolated from a heart cDNA library 
(accession # AA919544) using the NCBI search program. Analysis of the translated sequence demonstrated 
significant homology (33.47%, determined by using Bestfit and GAP programs from GCG) to a peptide 
encoded by a series of C.elegans cDNAs (accession # U41540) using Psi blast from the NCBI search 
3 5 program as well as FASTA3 from the ExPasy Program. The cloned Hhl cDNA may not contain the full- 

length message because it contains no classical AATAAA polyadenylation signal apparent in its 3' UTR. 
Analysis of the translated sequence using ProfileScan from the ExPasy Program identified a putative 
phosphotyrosine binding domain between amino acids 78-192. 
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Hzf. The Hzf sequence is 2.300 bp in length and contains a 396 amino acid ORF with a putative u-anslation 
start site at nucleotide 58 embedded in a consensus Kozak sequence. There is a single AATAAA 
polyadenylation signal in the SITTR. Sequence analysis demonstrated that Hzf \% a novel gene encoding 
a polypeptide that includes 3 zinc finger motifs of the C2H2 type (Figure 6B). The Hzf putative peptide has 
5 sequence similarity (22-32%) with several C2H2 type zinc finger containing polypeptides, including the 

Xenopus laevis dsRBP-Zfa (41), the murine p53-inducible zinc finger polypeptide, Wig-l (42) and its rat 
homolog PAG608 polypeptide (43). 

RNA in situ hybridization and northern analysis. To determine whether the expression pattern oUacZ 
described above reflected the endogenous expression pattern of the trapped genes, the X-gal staining 

1 0 pattern in 14.5 d.p.c heterozygous embryos was compared with the pattern of expression observed by RNA 

in situ hybridization analysis in wild type 14.5 d.p.c embryos. In general, the RNA in situ signals for both 
Hzf and Hhl coincided with the patterns observed by staining for lacZ expression (data not shown). 
Northern blot analysis revealed Hhl transcripts of 3 different sizes in various tissues, suggesting that Hhl 
undergoes differential splicing in a tissue-specific manner in adult tissues (Figure 7A). A single 2.3kb Hzf 

15 message was observed in the bone marrow and brains of adult mice and at lower levels in the thymus 

(Figure 7B). 

Mice homozygous for the Hzf and Hhl gene trap insertion are viable and fertile. Hzf and Hhl 

heterozygous mice do not exhibit any apparent abnormalities and are fertile. Intercrosses of F] heterozygous 
mice for each of the Hhl and Hzf lines generated viable homozygous offspring at the appropriate Mendelian 

2 0 ratios. Mice homozygous for either insertion developed normally and did not exhibit any overt phenotypc, 

including the numbers of types of hematopoietic progenitor cells, as determined by in vitro colony assays 
(data not shown). The absence of a discernible phenotype in these mice might reflect the absence of any 
effect on gene expression as the result of the integration of the PTl vector into the Hhl and Hzf genes. To 
test this possibility, expression of both Hzf and Hhl RNA u-anscripts was analyzed in the brains of wild type 
25 and homozygous mice by RT-PCR using primers that span the integration site. For both //i/and Hhl, cDNA 

was amplified from both Hrf and Hhl homozygotes, suggesting that transcripts that extend 3' to the 
integration site were present (Figure. 8). This conclusion was further confirmed by Northern blot analysis 
using probes 3' to the site of vector integration. This analysis demonstrated no differences in the levels or 
size of the transcripts obtained firom wild-type or homozygous mice. As shown in Figure 6A, vector 

3 0 integration within the Hhl locus was within the 5' UTR upstream of the first coding exon. For H^^f, vector 

integration occurred downstream of the translational start site but upstream of an additional start site which 
also has a good Kozak consensus; therefore, alternative initiation codon usage may have occurred (Figure 
6B). Thus, the sites of vector integration could account for the lack of mutagenesis of these trapped loci. 
Alternatively, splicing around the gene U-ap vector may have occurred which would also result in the 
3 5 expression of endogenous transcript. The latter has been reported to occur for a number of trapped genes 

[44,45.46,47,48,49] and splicing around targeting constructs has also been shown to occur, resulting in the 
generation of partially functional polypeptides rather than the expected null mutation [501. 
DISCUSSION 
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Novel genes regulated during hematopoietic development were identified and characterized. ES 
cell clones containing a random gene trap insertion were induced to differentiate into hematopoietic cells 
by co-culture on 0P9 stromal cells. Clones were screened for /flcZexpressioii in undifferentiated cells as 
well as at various stages during hematopoietic differentiation. From a total of 1,350 ES clones, three clones 
5 exhibited lacZ expression within hematopoietic cells and two of these clones displayed a regulated 

expression pattern in vitro which was recapitulated during hematopoietic development in vivo. Molecular 
cloning and cDNA sequence analysis of the trapped genes in these two ES clones revealed that both genes, 
which have been denoted as //^and Hhl, are novel. Thus, this in vitro expression-based gene trap strategy 
is a successful approach for screening for novel hematopoietic genes. 

10 In undifferentiated Hzf ES cells, iacZ expression was observed in only a few cells and this low 

frequency of expression was maintained throughout hematopoietic differentiation in vitro. The expression 
of Hhl was upregulated being first detected in mesodermal colonies and later expressed in die majority of 
hematopoietic cells by day 12. During embryogenesis, neither H^f or Hhl was expressed in hematopoietic 
cells within blood islands in the extraembryonic yolk sac, the site of primitive hematopoiesis. Neither gene 

15 was expressed within the intraembryonic para-aortic splanchnopleur/aorta-gonad-mesonephros (p- 

Sp/AGM) region, the site where definitive hematopoiesis is thought to be initiated [51,52]. This lack of 
detectable Hhl or //^expression in these hematopoietic compartments does not exclude the possibility that 
cells within these regions express these genes. It is possible that in these compartments only a low 
frequency of cells express H^ot Hhl and/or expression in cells within these regions is below the level of 

2 0 detection.. Expression of both trapped genes in hematopoietic cells was observed first in the fetal iiver and 

subsequently adult bone marrow. The low frequency of f/^expression observed in vitro was recapitulated 
in vivo in the embryonic liver and in only 2 9o of adult bone marrow cells. The higher frequency of Hhl 
expression within day 12 hematopoietic cells in vitro was also recapitulated in vivo where 80% of adult 
bone marrow cells expressed lacZ. The majority of cells expressing Hrf were megakaryocytes, as 

25 determined by morphology, lacZ expression within CFU-Mk, and cell surface expression of the 

megakaryocyte-specific surface marker, CD61, Hrfwas also expressed in other hematopoietic cell lineages 
contained within CFU-GEMM and at a very low frequency in CFU-M. Hhl was more widely expressed than 
H:f within hematopoietic lineages, particularly in erythroid and megakaryocyte cells as well as in the 
majority of cells comprising Cnj-GEMM. 

^ 0 Both Hrf and Hhl were expressed in megakaryocytes, suggesting that these novel genes rnay play 

a role in the differentiation and/or maturation of this cell lineage. The major determinants of commitment 
to the megakaryocyte lineage and the regulation of gene expression in this lineage remain unclear. The 
generation of several knock-out mice have aided in our understanding of megakaryocyte differentiation. 
Mice lacking thrombopoietin (TPO) or its receptor c-Mpl have reduced numbers of megakaryocytes and 

35 as a consequence are also thrombocytopenic [8,53]. These mice also have reduced numbers of 

hematopoietic progenitors of multiple lineages [54,55]. Mice with a megakaryocyte-selective loss of 
GATA-1 and NF-E2 null mice are also thrombocytopenic as a result of arrested megakaryocyte 
differentiation and maturation [56,57]. Mice lacking GATA-1 or NF-E2 also display defects in 
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erythropoiesis [58.57], suggesting that these transcription factors are co-expressed within a bipotential stem 
cell capable of giving rise to both megakaryocytes and erythroid cells. Recently, it has been shown that the 
multiple zinc fmger polypeptide. Friend of GATA-1 (FOG), cooperates with GATA-1 to promote erythroid 
and megakaryocyte differentiation [59]. In mice lacking FOG due to a targeted null mutation in ES cells, 
5 the differentiation of erythroid precursors is blocked [60]. However, in contrast to GATA-1 deficien: mice. 

megakaryocytes are completely absent, suggesting a GATA-1 independent role for FOG perhaps at the very 
earliest stage of megakaryocyte differentiation from a bipotential erythroid/megakaryocyte progenitor. In 
this study, Hrf expression was observed in megakaryocytes but not in erythroid cells, suggesting that the 
function of this novel gene may be restricted to the megakaryocyte lineage. The presence of three zinc- 

1 0 fmger motifs in Hzf suggests that it may also be involved in transcriptional regulation in megakaryocytes. 

In contrast, Hhl was strongly expressed in both erythroid and megakaryocyte lineages. Given that both Hzf 
and Hhl were also expressed within multipotential CFU-GEMM colonies, these novel genes may be 
expressed in a bi/multipotential progenitor. 

In addition to expression in hematopoietic cells, Hhl was also expressed in the embryonic and 

1 5 adult heart and Hhl has significant homology to an EST isolated from a heart cDNA library. Nortliem blot 

analysis of adult mouse tissue also demonstrated Hhl transcripts in the brain and kidney. In adult tissue, Hzf 
was also expressed in the brain. Thus, these genes may also be involved in other developmental processes 
outside of the hematopoietic system. 

The gene trap strategy resulted in the identification of two novel genes with coincidence of 

20 expression within megakaryocytes. Insertion of the gene trap vector into //^and Hhl has allowed the 

molecular cloning of both genes and has also facilitated the rapid analysis of their expression during mouse 
embryogenesis. 
Example 2 

Mice with true null alleles for Hzf were generated, A targeting vector was designed to replace a 
25 5.5kb genomic fragment containing exons encoding 3 zinc fmger domains with a IRES LacZ and a 

neomycin resistance gene. The IRES LacZ and Neo resistance gene was inserted in the targeting vector in 
the sense orientation to the Hzf transcript. The targeting vector was electroporated into Rl ES cells. After 
positive and negative selection, 6 independent clones were screened after checking for homologous 
recombination using 5 prime and 3 prime flanking probes. A Neo probe was also used to check single 
3 0 integration. Three out of 6 clones were aggregated, resulting in germline transmission. Heterozygous 

offspring were interbred to generate homozygotes. The total number of new bom pups decreased at 3 
weeks after birth. In order to determine the stage of neonatal development affected by the Hzf mutation, 
neonatal genotyping was performed. One week old Hzf -/- pups were viable, usually of normal size and 
occurred at the expected Mendelian frequency. However, thereafter, typically from 2 to 3 weeks after birth, 
3 5 the percentage of homozygotes was decreased. Most of the Hzf -/- showed behavioral abnormalities. Six 

out of 24 Hzf -/- mice were able to lift their heads only to the left side. Seventeen out of 24 exhibited 
tremoring of their bodies when they were put on unstable surfaces (e.g. lid of the cage). Surviving mice 



WO-00/49145 



PCT/CA00/0017I 



- 35 - 

were smaller than their liitermates. Hemorrhage was frequently observed (e.g. at 3 weeks) in the brain of 
Hzf homozygous mutants. Therefore, in summary the following phenotype was observed: 

(a) The frequency of viable Hzf 7- mice decreased at the age of two weeks, although normal 
Mendelian ratios of Hzf mutants was observed at one week after birth. 
5 (b) Hemorrhage was frequently observed in the brain of Hzf -/- mice, 

(c) The majority of surviving Hzf mice were smaller than their littermates. presumably as a result of 
sustained hemorrhage during a period of rapid growth; they were able to lift their heads only to 
the right or left; and their bodies tremor when placed on unstable surfaces. 
The phenotype resembled the phenotype in the case of sustained brain damage resulting from a stroke in 
1 0 humans. 

The ultrastruciure of mature megakaryocytes from the Hzf-/- mouse was compared to mature 
megakaryocytes from the bone marrow of a Hzf +/+ mouse. The cytoplasm of megakaryocytes from the 
Hzf +/+ mouse showed platelet fields or tenitories that are clearly demarcated and many granules were 
observed. In contrast, the cytoplasm of megakaryocytes of the Hzf-/- mouse showed that the organization 
15 of the demarcation membrane system is aberrant, and granules are sparse. 

The present invention is not to be limited in scope by the specific embodiments described herein, 
since such embodiments are intended as but single illustrations of one aspect of the invention and any 
functionally equivalent embodiments are within the scope of this invention. Indeed, various modifications 

20 of the invention in addition to those shown and described herein will become apparent to those skilled in 

the art from the foregoing description and accompanying drawings. Such modifications are intended to fall 
within the scope of the appended claims. 

All publications, patents and patent applications referred to herein are incorporated by reference 
m their entirety to the same extent as if each individual publication, patent or patent application was 

25 specifically and individually indicated to be incorporated by reference in its entirety. All publications, 

patents and patent applications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the cell lines, vectors, methodologies etc. which are reported therein which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
^ It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and 

"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference 
to "a host cell" includes a plurality of such host cells, reference to the "antibody" is a reference to one or 
more antibodies and equivalents thereof known to those skilled in the art, and so forth. 
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Table 1 

Summary of lacZ expression within ES cell gene trap clones upon hema- 
topoietic differentiation^ 

Number of clones Days after induction 

0 5 10 

White clones 

1278 _ _ _ 

1 - + - 7H7 

1 - + + Hhl 



Blue clones 

2 + + + 4AllandHzf 

24 + + _ 

44 + _ _ 

1350 total 



'White clones, lacZ negative in undifferentiated ES cells; blue clones, 
lacZ positive in undifferentiated ES cells; +, lacZ positive during differ- 
entiation; /flcZ negative during differentiation. 
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WE CLAIM : 

I. An isolated Hzf nucleic acid molecule of at least 30 nucleotides which hybridizes to SEQ ID NO. 1. or the 
complement of SEQ ID NO. 1, under stringent hybridization conditions. 

5 2. An isolated Hhl nucleic acid molecule of at least 30 nucleotides which hybridizes to SEQ ID NO. 3 or the 

complement of SEQ ID N0.3, under stringent hybridization conditions. 

3. An isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence encoding a polypeptide having substantial sequence identity with 
the amino acid sequence of SEQ. ID. N0.2 or SEQ. ID. NO 4.; 
1 0 (ii) nucleic acid sequences complementary to (i); 

(ili) a degenerate form of a nucleic acid sequence of (i); 

(iv) a nucleic acid sequence comprising at least 18 nucleotides and capable of hybridizing 
to a nucleic acid sequence in (i), (ii), or (iii); 

(v) a nucleic acid sequence encoding a truncation, an analog, an allelic or species variation 
^5 of a polypeptide comprising the amino acid sequence of SEQ. ID, N0.2, or SEQ. ID. 

NO 4; or 

( vi ) a fragment, or allelic or species variation of (i), (ii) or (iii). 

4. An isolated nucleic acid molecule which comprises: 

(a) a nucleic acid sequence having substantial sequence identity or sequence similarity 
2 0 with a nucleic acid sequence of SEQ. ID. NO. 1 or 3; 

(b) nucleic acid sequences complementary to (i), preferably complementary to the full 
nucleic acid sequence of SEQ, ID. NO. 1 or 3; 

(c) nucleic acid sequences differing from any of the nucleic acid sequences of (i) or (ii) 
in codon sequences due to the degeneracy of the genetic code; or 

25 (d) a fragment, or allelic or species variation of (i), (ii) or (iii). 

5. An isolated nucleic acid molecule which encodes a protein which binds an antibody of a Hzf or Hhl 
polypeptide. 

6. A regulatory sequence of an isolated nucleic acid molecule as claimed in any of the preceding claims 
fused to a nucleic acid which encodes a heterologous protein. 

30 7. A vector comprising a nucleic acid molecule of any of the preceding claims. 

8. A host cell comprising a nucleic acid molecule of any of the preceding claims. 

9. An isolated Hzf polypeptide comprising an amino acid sequence of SEQ. ID. NO. 2. 

10. An isolated Hhl polypeptide comprising an amino acid sequence of SEQ. ID. NO. 4. 

II. An isolated polypeptide having at least 50% amino acid sequence identity to an amino acid sequence 
35 ofSEQ. ID.NO. 2or4. 

12. A method for preparing a polypeptide as claimed in claim 9 or 10 comprising: 

(a) transferring a vector as claimed in claim 7 into a host cell; 

(b) selecting transformed host cells from untransformed host cells; 
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(c) culturing a selected transformed host cell under conditions which allow expression of the 
protein; and 

(d) isolating the polypeptide. 

13. A polypeptide prepared in accordance with the method of claim 12. 
5 14. An antibody having specificity against an epitope of a polypeptide as claimed in claim 9 or 10. 

15. An antibody as claimed in claim 14 labeled with a detectable substance and used to detect the 
polypeptide in biological samples, tissues, and cells. 

16. A probe comprising a sequence encoding a polypeptide as claimed in claim 9 or 10, or a part thereof. 

17. A method of diagnosing and monitoring conditions mediated by a polypeptide as claimed in claim 9 
10 or 10 by determining the presence of a nucleic acid molecule as claimed in any of the preceding claims 

or a polypeptide as claimed in any of the preceding claims. 

18. A method as claimed in claim 17 wherein the condition is a disorder of the hematopoietic system. 

19. A method for identifying a substance which associates with a polypeptide as claimed in claim 9 or 10 
comprising (a) reacting the polypeptide with at least one substance which potentially can associate with 

1 5 the polypeptide, under conditions which permit the association between the substance and pol>peptide, 

and (b) removing or detecting polypeptide associated with the substance, wherein detection of 
associated polypeptide and substance indicates the substance associates with the polypeptide. 

20. A method for evaluating a compound for its ability to modulate the biological activity of a polypeptide 
as claimed in claim 9 or 10 comprising providing a known concentration of the polypeptide with a 

2 0 substance which associates with the polypeptide and a test compound under conditions which permit 

the formation of complexes between the substance and polypeptide, and removing and/or detecting 
complexes. 

21. A method for detecting a nucleic acid molecule encoding a polypeptide comprising an amino acid 
sequence of SEQ. ID. NO. 2 or 4 in a biological sample comprising the steps of: 

25 (a) hybridizing a nucleic acid molecule of claim 4 to nucleic acids of the biological 

sample, thereby forming a hybridization complex; and 
(b) detecting the hybridization complex wherein the presence of the hybridization 
complex correlates with the presence of a nucleic acid molecule encoding the 
polypeptide in the biological sample. 
30 22. A method as claimed in claim 2 1 wherein nucleic acids of the biological sample are amplified by the 

polymerase chain reaction prior to the hybridizing step. 
23. A method for treating a condition mediated by a polypeptide as claimed in claim 9 or 10 comprising 
administering an effective amount of an antibody as claimed in claim 14 or a substance or compound 
identified in accordance with a method claimed in claim 19 or claim 20. 
35 24. A method as claimed in claim 23 wherein the condition is a disorder of the hematopoietic system. 

25. A composition comprising one or more of a nucleic acid molecule or polypeptide claimed in any of 
the preceding claims, or a substance or compound identified using a method as claimed in any of the 
preceding claims, and a pharmaceutically acceptable carrier, excipient or diluent. 
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26. Use of one or more of a nucleic acid molecule or polypeptide claimed in any of the preceding claims, 
or a substance or compound identified using a method as claimed in any of the preceding claims in the 
preparation of a pharmaceutical composition for treating a condition mediated by a polypeptide as 
claimed in claim 9 or 10. 

27. A transgenic non-human mammal wherein a nucleic acid molecule as claimed in claim 3, or a portion 
thereof is deleted in the mammal *s germ cells and somatic cells. 

28. A transgenic non-human mammal all of whose germ cells and somatic cells contain a recombinant 
expression vector that inactivates or alters a gene encoding a Hzf polypeptide as claimed in claim 4 
resulting in a Hzf associated pathology. 

29. A transgenic non-human mammal which doe not express a Hzf polypeptide as claimed in claim 9 
resulting in a Hzf associated pathology. 

30. A transgenic animal assay system which provides a model system for testing for an agent that reduces 
or inhibits an Hzf associated pathology 

(a) administering the agent to a transgenic non-human animal as claimed in claim 29; and 

(b) determining whether said agent reduces or inhibits an Hzf associated pathology in the transgenic 
mouse relative to a transgenic mouse of step (a) which has not been administered the agent. 
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FIGURE 1 



Hhl Hzf 




WOi)0/49145 



2/11 

FIGURE 2A-H 
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FIGURE SB 



► 12G10-5 ^ ► 12G10-6 

TaAACOGGACQGGCCCaOCCGCAOAOCTCTQaOCQGCOGQTOGQACTCOOGCGTCTGACA 

aAGTTaTTQTTCCGGCACGCCGCGaAOACGTaAG 3TCCAGGTCCCGAAACCAAAGAAGA . . 



En-2 / LacZ 
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► 4H5-1 ^ '^HS-S ^ 

TGCAOAACGOCCCCCOGQACAQAACQACGCGGAACCCCGGaAGCCAGOGTCCCCAOCATO 

ATCCTCG 3TCCCAGGTCCCGAAAACCAAAGA .... 

En-2/Lac2 ^ 
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FIGURE 6A 



ctcgggcgtctgacagagttgttgttccggcacgccgcggagacgtgagWgttccaggt 60 

ggatgtgggcagagaggtttgcagaactgaaatggaggtcggagcttcgttccagaaggt 120 

tagtgggtcatctgattctgtggccacactgaacagtgaagaatttgttttggtttctca 180 

gcacacagatgccacttctataaaggatgatgggaagccacagctgaagatagcttccaa 240 

tggtgatgagcagttggaaaaagccatggaagagattttgagagattccgagaaaggaca 300 

meeilrdsekgq 

aagcggtctacctgttgattgccaaggatccagtgagatttcagactgtccttttggaga 360 
sglpvdcqgsseisdcpfgd 

tgtgccggccagccaaacaactaagccgcctctccagttaattttggatccatctaatac 420 
vpasqttkppl qli ldpsnt 

agaaatttccacacccagaccatcttctccaagcagatttcctgaagaagacagtgttct 480 
eistprpsspsrfpeedsvl 

ctttaacaagctgacatacttaggatgtatgaaggtttcttcccacgcagtgaagtggag 540 
fnkltylgcmkvsshavkwr 

gctttacggccatgccaccatgagagcttccagtcagtacccctttgctgttactctgta 600 
lyghatmrassqypfavtly 

tgtgcccaatgttccagaaggatctgtgagaatcatagaccagtcaagcaatgtggagat 660 
vpnvpegsvriidqssnvei 

agcatcttttccaatttataaagtgcttttctgtgcacgtgggcatgatgagacagccga 720 
asfpiykvlfcarghdetae 

gagcaattgctttgcatttacagagagttctcatggctcagaagaatttcagatacatgt . 780 
sncfaftesshgseefqihv 

tttctcctgtgaaattaaagaggcagtaagcagaattttatatagtttctgcactgcatt 840 
fsceikeavsrilysfctaf 

caaacgttcttccagacaagtgtctgatgttaaagactcagtcatcccgactcccgacag 900 
krssrqvsdvkdsviptpds 

tgatgtgtttaccttcagtgtctccttggaggtcaaagaagatgatggaaaaggaaactt 960 
dvftfsvslevkeddgkgnf 

tagtcccgtgcctaaggatagagataaattttatatcaaaataaagcaaggaatagagaa 1020 
spvpkdrdkfyikikqgiek 

gaaggttgtgattacagttcagcaactgtctaacaaagaattagctattgagagatgttt 1080 
kvvitvqqlsnkelaiercf 

tggaatgttattaagcccaggtcgaaacgtgaagaacagtgacatgcatttactggatat 1140 
gmllspgrnvknsdmhlldm 

ggagtcccggaaagagctatgatgggagagcttacgtcatcacgggcatgtggaacccca 1200 

E S R K E L * 



ACGCACCAATATTTCTGGCTCTTAATGAAGAAACCCTGTCTC AAAAAAAACCAAAAAA 1 258 



wo 00/49145 



PCT/CAOQ/00171 



9/11 

FIGURE 6B 



CCGAACCCCGGGAGCCAGGGTCCCCACCATGATCCTcJiCAGCCTCAGCCGCGCCGGGCC 60 

MILGSLSRAGP 

CCTCrcTCTGCTCCGGCAGCCCCCCATCATGCAGCCACCGATGG 

GCCCrrCCCACTAGAGrcAGCCCCAACCCTGGGCCTCTTCAGCAACTACA , gQ 



P ^ P L E P A P T L G L F S Ty S T M D 

CCCTCTACAGJ^CTCTGCnXTTCCCACAC™ 
PVOKAVLSHTFGGPLLKTKR 

GCCACTCATTT CCTGTMTGTCTCTCAGATCC^ 300 

GCACTACAAGGGTAATrGrrA TGCCCGAAGACTCAAAGGCATCGAAGCTCCC^ 360 
" y K G N R ARRVKGIEAAKTR 

AGGCAGGGAGCCTAGTCTCCGGGAATCAGGAGATCCAGCTCCAGCAGGCA 420 
^«EPSVRESGDPAPAGSIPP 

GAGTGGGGATGGTCTAGCCCCTCGTCCACT^ 480 

AGGATCCCCAGAGAAACAGCCTGGCTCCCCATCCCCTCCCACTGTrCC^ 
GSPEKOPGSPSPPSVPESGQ 

GGCTCTAACXrj^GGGTGAAGGGGGAACTTCAGTCCCAG^ 600 
.»-VTKGEGGTSVPASLPGGSK 

GGAAGAGGAGGAGJU^GGCTAAGCGTCTGCTCTA CTGTGCACTGTGCAAr^^^^ 660 
eEEEKAKRLLY | C A L C K V A V N 

CTCCCTGTCCCAGCTTGAGGCACATAACAAAGGTACTAAnr.r .^n.n..>nn^ ^20 

— — ^sqleah nkgtkh] k T I l e a 

CGGAAGTGGGCrrGGGAGCCATCAAAGCTTACCCTCGGGTGGGGCCTCCAATT 780 
^I't'LGAIKAYPRVGPPlLGN 

CCAGAGGCTCCTGCCCAGGGACGGAACCrrCCA CTGTGAGATCTGCAATCTCAACnTT-^^ 840 
QRLLPRDGTFH | C E I C N V K V^IT 

T^CGGAGGTCCAGrTGAAACAGCACATCTCCAGCACGAr^r^P rv..n.^..^^^^^^ 900 

— ^^Q^i^Qh issrrhI r d g V a g 

GAAGrcCAACCCTCTAmAGCCGGCACAAGAAGCCTAGGGGCCCTG 
**^''^^^SRHKKPRGAAELAG 

C^CGCTGACTTTCTCAAAGGAGCTGCCCAA^^ 1020 
CCTAOCGGTGGCTGCGffTCATGGCCGCTGCAGC^ 1080 

^avaavmaaaagsplslrpa 

TCCAGCTGCACCTCTTCTGCAGGGACCACCGATCACACACCCTCTA^^^ , 
t-AAPLLOGPPlTHPLLHPAP 

AGG ACCCATCCGAACTTCGCACGGACCCATCCTCTTCrCCCCCTACTC^^ 1 200 

^fiRTAHCPILFSPY* 



TCCAS^rn^^'^'^'^^'^'^^^'^^^^^^^^C^C^CCGAG^^^ '"^ 
I^^?S^^°°^'^^°°^'^'^°^^°°*2A^GGGGCTTCAGAACGGGGGGAACACCC- 

S^^AS^^^°°''*'''^''^^^^^^^^^°*===A'rc'I^CATCCAACCCTrCC 



!GT 1320 

:accccaga 1 380 
:ac 1440 



.740 



2300 



wo 00/49145 



10/11 



PCT/CA0O/0Ol7i 



FIGURE 7 



A 
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FIGURE 8 
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SEQUENCE LISTING 

<110> Mount Sinai Hospital 

<120> NOVEL HEMATOPOIETIC GENES AND POLYPEPTIDES 

<130> P170PCT9 

<140> not yet issued 
<141> 2000-02-18 

<150> US 60/120,972 
<151> 1999-02-19 

<160> 14 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 2297 
<212> DNA 

<213> Murinae gen. sp . 
<400> 1 

cggaaccccg ggagccaggg tccccagcat gatcctcggc agcctgagcc gggcggggcc 60 
cctccctctg ctccggcagc cccccatcat gcagccaccg atggacctca agcagatcct 120 
gcccttccca ctagagccag ccccaaccct gggcctcttc agcaactaca gcacaatgga 180 
ccctgtacag aaagctgtgc tctcccacac ttttggagga cccttgctca agaccaagcg 240 

gccagtcatt tcctgtaatg tctgtcagat ccgcttcaat tctcagagcc aggctgaggc 300 
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gcactacaag 


ggtaatcgcc 


atgcccgaag 


agtcaaaggc 


atcgaagctg 


ccaaaacccg 


360 


aggcagggag 


cctagtgtcc 


gggaatcagg 


agatccagct 


ccagcaggca 


gcatccctcc 


420 


gagtggggat 


ggtgtagccc 


ctcgtccagt 


ttccatggag 


aatggcctgg 


gtccagctcc 


480 


aggatcccca 


gagaaacagc 


ctggctcccc 


atcccctccc 


agtgttccag 


agtcgggaca 


540 


gggtgtaacc 


aagggtgaag 


ggggaacttc 


agtcccagct 


tccctgcctg 


ggggtagcaa 


600 


ggaagaggag 


gagaaggcta 


agcgtctgct 


ctactgtgca 


ctgtgcaagg 


tggctgtgaa 


660 


ctccctgtcc 


cagcttgagg 


cacataacaa 


aggtactaag 


cacaagacaa 


ttttggaggc 


720 


cggaagtggg 


ctgggagcca 


tcaaagctta 


ccctcgggtg 


gggccaattc 


tggggaacca 


780 


gaggctcctg 


cccagggacg 


gaaccttcca 


ctgtgagatc 


tgcaatgtca 


aggtcaattc 


840 


ggaggtccag 


ctgaaacagc 


acatctccag 


caggaggcac 


cgagatggcg 


tggctgggaa 


900 


gcccaaccct 


ctactgagcc 


ggcacaagaa 


gcctaggggc 


gctgcagagc 


tggcgggcac 


960 


gctgactttc 


tcaaaggagc 


tgcccaagtc 


cctggccggt 


ggcctgctcc 


ccagccccct 


1020 


^9C99tggct 


gcggtgatgg 


ccgctgcagc 


aggatctccg 


ctgtccctgc 


gtccagctcc 


1080 


agctgcacct 


cttctgcagg 


gaccaccgat 


cacacaccct 


ctactccacc 


ctgccccagg 


1140 


acccatccga 


actgcgcacg 


gacccatcct 


cttctccccc 


tactgacctc 


aaccctgaac 


1200 


ccctcccatt 


gaatccccca 


cctccagccg 


ggacccaggt 


ctccgggctc 


ccagcccgcc 


1260 


cctcttcccg 


gcgctccctg 


aatggtctct 


ccttcccccc 


accccgaggt 


acggggttcc 


1320 


aggaaagggg 


aggggtagcg 


ggggaggggg 


gcttcagaag 


ggggggaaca 


ccccagatct 


1380 


caggaacccc 


gccccctgcc 


cttccctctc 


ccctagaaaa 


gaggggggcc 


gtctcacccc 


1440 


cgagccccct 


cggagacacc 


ccctcccaaa 


agccatgtcc 


atccaaccct 


tcccctccaa 


1500 




aaacggggx: c 


^ ^ ^ a a ^ ^ ^ 

cacaagccaw 


ggtcggggcc 




acauggau t.u 




tcttggcaat 


aagcggactc 


tgggactccg 


gctcccctac 


cccaaactaa 


agcgcttccg 


1620 






caggaggagg 


ggaaca^^cg 


ggatccuygg 




xo a u 


gcactttggt 


tttaccgcct 


gcaacctcac 


tgtgcccgcc 


ccgcaccatg 


ccccagcccc 


1740 


gggtccagtc 


gggcccatcg 


cagggggcag 


cgcttggggg 


catctcaggc 


acttgggtgg 


1800 


gaccaaggag 


atgccctcat 


agacccttcc 


ctcgccttct 


tcctccccgg 


tccgggttcc 


1860 


attcttttca 


ccagcaccca 


tcgcccaagg 


ggtaccaagg 


ggggcaaggg 


gtgtccagtc 


1920 


caagcccacc 


cccgcctcgc 


cttccgcaaa 


actgtgagca 


aaaagcaata 


gaagcctcgc 


1980 
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ccccgccccc ggcccgtccc tttccgcagg atttgccgtc tgtagcctcc ccattccagt 2 040 

tcctagacct catggctgcc ccctcccgtg acacctccac tgcacaactc gggcgggggc 2100 

gtgaccccca acccgtccct tctgtggttt ccgtgtggtc agcccttcca gccgccgacc 216 0 

cgggatgggg actggccccc actggccctc ccctccccat ggactctttc tgcttgacaa 2220 

tgtagcaaac cccctccctc tttctctctc tttgacaata aagtctggat ttgttctgcc 2280 

ctctcgccaa ggaaaaa 2297 

<210> 2 
<211> 385 
<212> PRT 

<213> Murinae gen. sp. 
<400> 2 

Met lie Leu Gly Ser Leu Ser Arg Ala Gly Pro Leu Pro Leu Leu Arg 
15 10 15 

Gin Pro Pro lie Met Gin Pro Pro Met Asp Leu Lys Gin lie Leu Pro 
20 25 30 

Phe Pro Leu Glu Pro Ala Pro Thr Leu Gly Leu Phe Ser Asn Tyr Ser 
35 40 45 

Thr Met Asp Pro Val Gin Lys Ala Val Leu Ser His Thr Phe Gly Gly 
50 55 60 



Pro Leu Leu Lys Thr Lys Arg Pro Val lie Ser Cys Asn Val Cys Gin 

65 70 75 80 

lie Arg Phe Asn Ser Gin Ser Gin Ala Glu Ala His Tyr Lys Gly Asn 
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85 90 95 

Arg His Ala Arg Arg Val Lys Gly He Glu Ala Ala Lys Thr Arg Gly 
100 105 110 

Arg Glu Pro Ser Val Arg Glu Ser Gly Asp Pro Ala Pro Ala Gly Ser 
115 120 125 

He Pro Pro Ser Gly Asp Gly Val Ala Pro Arg Pro Val Ser Met Glu 
130 135 140 

Asn Gly Leu Gly Pro Ala Pro Gly Ser Pro Glu Lys Gin Pro Gly Ser 
145 150 155 160 

Pro Ser Pro Pro Ser Val Pro Glu Ser Gly Gin Gly Val Thr Lys Gly 
165 170 175 

Glu Gly Gly Thr Ser Val Pro Ala Ser Leu Pro Gly Gly Ser Lys Glu 
180 185 190 

Glu Glu Glu Lys Ala Lys Arg Leu Leu Tyr Cys Ala Leu Cys Lys Val 
195 200 205 

Ala Val Asn Ser Leu Ser Gin Leu Glu Ala His Asn Lys Gly Thr Lys 
210 215 220 

His Lys Thr He Leu Glu Ala Gly Ser Gly Leu Gly Ala He Lys Ala 
225 230 235 240 
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Tyr Pro Arg Val Gly Pro lie Leu Gly Asn Gin Arg Leu Leu Pro Arg 
245 250 255 

Asp Gly Thr Phe His Cys Glu lie Cys Asn Val Lys Val Asn Ser Glu 
260 265 270 

Val Gin Leu Lys Gin His lie Ser Ser Arg Arg His Arg Asp Gly Val 
275 280 285 

Ala Gly Lys Pro Asn Pro Leu Leu Ser Arg His Lys Lys Pro Arg Gly 
290 295 300 

Ala Ala Glu Leu Ala Gly Thr Leu Thr Phe Ser Lys Glu Leu Pro Lys 
305 310 315 320 

Ser Leu Ala Gly Gly Leu Leu Pro Ser Pro Leu Ala Val Ala Ala Val 
325 330 335 

Met Ala Ala Ala Ala Gly Ser Pro Leu Ser Leu Arg Pro Ala Pro Ala 

340 345 350 

Ala Pro Leu Leu Gin Gly Pro Pro He Thr His Pro Leu Leu His Pro 
355 360 365 

Ala Pro Gly Pro He Arg Thr Ala His Gly Pro He Leu Phe Ser Pro 
370 375 380 



Tyr 
385 
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<210> 3 
<211> 1258 
<212> DNA 

<213> Murinae gen. sp. 



<400> 3 



ctcgggcgtc 


tgacagagtt 


gttgttccgg 


cacgccgcgg 


agacgtgagt 


tgttccaggt 


60 


ggatgtgggc 


agagaggttt 


gcagaactga 


aatggaggtc 


ggagcttcgt 


tccagaaggt 


120 


tagtgggtca 


tctgattctg 


tggccacact 


gaacagtgaa 


gaatttgttt 


tggtttctca 


180 


gcacacagat 


gccacttcta 


taaaggatga 


tgggaagcca 


cagctgaaga 


tagcttccaa 


240 


tggtgatgag 


cagttggaaa 


aagccatgga 


agagattttg 


agagattccg 


agaaaggaca 


300 


aagcggtcta 


cctgttgatt 


gccaaggatc 


cagtgagatt 


tcagactgtc 


cttttggaga 


360 


tgtgccggcc 


agccaaacaa 


ctaagccgcc 


tctccagtta 


attttggatc 


catctaatac 


420 


agaaatttcc 


acacccagac 


catcttctcc 


aagcagattt 


cctgaagaag 


acagtgttct 


480 




w CL\a>Ci CL ^ l_ 


1" a rrcTJ? i" crt" a t" 








540 


gctttacggc 


catgccacca 


tgagagcttc 


cagtcagtac 


ccctttgctg 


ttactctgta 


600 


tgtgcccaat 


gttccagaag 


gatctgtgag 


aatcatagac 


cagtcaagca 


atgtggagat 


660 


agcatctttt 


ccaatttata 


aagtgctttt 


ctgtgcacgt 


gggcatgatg 


agacagccga 


720 


gagcaattgc 


tttgcattta 


cagagagttc 


tcatggctca 


gaagaatttc 


agatacatgt 


780 


tttctcctgt 


gaaattaaag 


aggcagtaag 


cagaatttta 


tatagtttct 


gcactgcatt 


840 


caaacgttct 


tccagacaag 


tgtctgatgt 


taaagactca 


gtcatcccga 


ctcccgacag 


900 


tgatgtgttt 


accttcagtg 


tctccttgga 


ggtcaaagaa 


gatgatggaa 


aaggaaactt 


960 


tagtcccgtg 


cctaaggata 


gagataaatt 


ttatatcaaa 


ataaagcaag 


gaatagagaa 


1020 


gaaggttgtg 


attacagttc 


agcaactgtc 


taacaaagaa 


ttagctattg 


agagatgttt 


1080 


tggaatgtta 


ttaagcccag 


gtcgaaacgt 


gaagaacagt 


gacatgcatt 


tactggatat 


1140 


ggagtcccgg 


aaagagctat 


gatgggagag 


cttacgtcat 


cacgggcatg 


tggaacccca 


1200 


acgcaccaat 


atttctggct 


cttaatgaag 


aaaccctgtc 


tcaaaaaaaa 


ccaaaaaa 


1258 
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<210> 4 
<211> 298 
<212> PRT 

<213> Murinae gen. sp. 
<400> 4 

Met Glu Glu lie Leu Arg Asp Ser Glu Lys Gly Gin Ser Gly Leu Pro 
15 10 15 

Val Asp Cys Gin Gly Ser Ser Glu lie Ser Asp Cys Pro Phe Gly Asp 
20 25 30 

Val Pro Ala Ser Gin Thr Thr Lys Pro Pro Leu Gin Leu lie Leu Asp 
35 40 45 

Pro Ser Asn Thr Glu lie Ser Thr Pro Arg Pro Ser Ser Pro Ser Arg 
50 55 60 

Phe Pro Glu Glu Asp Ser Val Leu Phe Asn Lys Leu Thr Tyr Leu Gly 
65 70 75 80 

Cys Met Lys Val Ser Ser His Ala Val Lys Trp Arg Leu Tyr Gly His 
85 90 95 

Ala Thr Met Arg Ala Ser Ser Gin Tyr Pro Phe Ala Val Thr Leu Tyr 
100 105 110 

Val Pro Asn Val Pro Glu Gly Ser Val Arg lie lie Asp Gin Ser Ser 
115 120 125 
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Asn Val Glu He Ala Ser Phe Pro He Tyr Lys Val Leu Phe Cys Ala 
130 135 140 

Arg Gly His Asp Glu Thr Ala Glu Ser Asn Cys Phe Ala Phe Thr Glu 
145 150 155 160 

Ser Ser His Gly Ser Glu Glu Phe Gin He His Val Phe Ser Cys Glu 
165 170 175 

He Lys Glu Ala Val Ser Arg He Leu Tyr Ser Phe Cys Thr Ala Phe 
180 185 190 

Lys Arg Ser Ser Arg Gin Val Ser Asp Val Lys Asp Ser Val He Pro 
195 200 205 

Thr Pro Asp Ser Asp Val Phe Thr Phe Ser Val Ser Leu Glu Val Lys 
210 215 220 

Glu Asp Asp Gly Lys Gly Asn Phe Ser Pro Val Pro Lys Asp Arg Asp 
225 230 235 240 

Lys Phe Tyr He Lys He Lys Gin Gly He Glu Lys Lys Val Val He 
245 250 255 

Thr Val Gin Gin Leu Ser Asn Lys Glu Leu Ala He Glu Arg Cys Phe 
260 265 270 

Gly Met Leu Leu Ser Pro Gly Arg Asn Val Lys Asn Ser Asp Met His 
275 280 285 
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Leu Leu Asp Met Glu Ser Arg Lys Glu Leu 



290 



295 



<210> 5 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

gcagagctct gggcggcggg t 21 
<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 6 



ctcgggcgtc tgacagagtt g 



21 



<210> 7 



<211> 18 



<212> DNA 



<213> Artificial Sequence 
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<220> 



<223> Description of Artificial Sequence : primer 



<400> 7 



cccgggacag aacgacgc 



18 



<210> 8 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 8 

cggaccccgg gagccagg 18 

<210> 9 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 9 



attacgccag ctggcgaaag g 



21 
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<210> 10 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

gattggtggc gacgactcct g 21 

<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

tatcttcagc tgtggcttcc c 21 

<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 12 

ccgtcgactc tggcgccgct gctctgtcag 

<210> 13 
<2ll> 19 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence : primer 
<400> 13 

cggaaccccg ggagccagg 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : seql4 
<400> 14 

tctggggatc ctggagctgg ac 
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